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The AAAS David and Betty Hamburg Award for 
Science Diplomacy recognizes an individual or 

a limited number of individuals working together 
in the scientific and engineering or foreign affairs 
communities who are making an outstanding 
contribution to furthering science diplomacy. 


Over the past 30 years, AAAS has honored an 
international cadre of science luminaries for their 
contributions to international scientific cooperation 
and science diplomacy. In 2021, the award was 
renamed after David and Betty Hamburg to 
recognize their unparalleled commitment to the 
significant role of science diplomacy to advance 
science, human rights, peace, and cooperation. 


¢ The award is open to all regardless of 
nationality or citizenship. 
¢ We accept self-nominations. 


¢« Nominees must be living at the time 
of their nomination. 


To learn more, visit 


AAAS gratefully acknowledges Carnegie Corporation of New York for their 
generous support to launch the AAAS David and Betty Hamburg Award for 
Science Diplomacy and the individuals and foundations whose contributions 
have begun an endowment that will allow us to sustain it in perpetuity. 
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2022 WINNER 
Sir David King, Ph.D. 


Former United Kingdom's Chief 
Scientific Advisor (2000-2007) 


Honored for his diplomatic stewardship 
towards international consensus for 
urgent global action on climate change. 


Winners will receive: 
Monetary prize of $10,000 
Commemorative plaque 


Worldwide promotion of their achievements 
through AAAS communication channels, 
including AAAS publications, member news, 
website, and social media 


Complimentary registration to the 
2023 AAAS Annual Meeting 


Reimbursement for travel and hotel expenses 
to attend the AAAS Annual Meeting 


The opportunity to publish in 
Science & Diplomacy 


MNAAAS 


AMERICAN ASSOCIATION FOR 
THE ADVANCEMENT OF SCIENCE 


Your Legacy to Science 


AN ESTATE GIFT TO THE 
AMERICAN ASSOCIATION FOR THE ADVANCEMENT OF SCIENCE 


Since 1848, our founding year, the American Association for the 
Advancement of Science (AAAS) has been deeply committed to 
advancing science, engineering and innovation around the world 


for the benefit of all people. 


By making AAAS a beneficiary of your will, trust, retirement plan or 
life insurance policy, you become a member of our 1848 Society, 
joining Thomas Edison, Alexander Graham Bell and the many 
distinguished individuals whose vision led to the creation of AAAS 


and our world-renowned journal, Science, so many years ago. 


Unlike many of its peers, Science is not for-profit. Your estate gift 
would provide long-term financial stability and durable annual 
income that will support operations and competitive innovation 


for years to come. This support is vital. 


Yes, | would like more information about joining the AAAS 1848 Society. 


PLEASE CONTACT ME AT: 
Name: 

Address: 

City: State: 
Email: 


RETURN THIS FORM TO: 


“As a teacher and instructor, | bear responsibility for the 
younger generations. If you have extra resources, concentrate 


them on organizations, like AAAS, that are doing work for all.” 


—Prof. Elisabeth Ervin-Blankenheim, 1848 Society member 


If you intend to include AAAS in your estate plans, provide this 


information to your lawyer or financial adviser: 


Legal Name: American Association for the Advancement of Science 
Federal Tax ID Number: 53-0196568 
Address: 1200 New York Avenue, NW, Washington, DC 20005 


If you would like more information on making an estate gift to 


AAAS, cut out and return the form below or send an 


1348 


SOCIET ¥ 


email to philanthropy@aaas.org. Additional 
details are also available online at 


www.aaas.org/1848Society. 


AV AAAS 


Zip code: Country: 


Phone: 


AAAS Office of Philanthropy and Strategic Partnerships * 1200 New York Avenue, NW + Washington, DC 20005 USA 
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As a grassroots effort, Bio-protocol is building a community to foster interactions about 
the fine details of methodology in the life sciences. 


Reproducing results lies at the heart of life sciences, but it's a difficult challenge. "The 
complexity of what we do is so high, and there are so many variables that we try to 
control as much as we can, but that’s the real reason why the reproducibility is so 
hard,” says lvan Zanoni, associate professor in pediatrics at Harvard Medical School 
and associate editor at Bio-protocol. As a journal—and now a major influencer in the 
scientific world—Bio-protocol is taking on that challenge. 

“It just has to do with the facts of biology,” says Holden Maecker, director of the 
Human Immune Monitoring Center at the Stanford University School of Medicine 
and advisor to Bio-protocol. “It's difficult to know which variables are going to be 
important for a given kind of experiment.” 

As a first step toward clarity and consistency, scientists need to communicate 
more about the precise details of a protocol. As then Science editor-in-chief Marcia 
McNutt wrote in 2014: “A transparent and rigorous approach ... can almost always 
shine a light on issues of reproducibility” (7). The lack of such an approach spawned 
the Bio-protocol concept. 


In 2010, as a postdoctoral researcher at Stanford University, bacterial expert Fanglian 
He started working on a new project, but she struggled to replicate methods from 
published research articles. She recalls, “It was very frustrating to spend time 
troubleshooting a method other people had already used successfully.’ In talks 
with colleagues, she learned that they were facing similar problems. So, in 2011, He 
launched Bio-protocol, where she is now the publisher. 

This journal and its programs have a straightforward mission, as described by 
Caroline Shamu, associate dean for research cores and technology at Harvard Medical 
School and editor-in-chief of Bio-protoco!: "It's a place to publish well-documented 


+i0-protocol 


Volume: 11 Issu Volume: 11 


Nov 05, 2 


Bio-protocol journal covers 


protocols relating to biology.’ Those protocols range from everyday methods to the 
latest trends. Plus, Bio-protoco/ continues the passion that led He to start this journal. 
Looking to help build a community rather than make a profit, Bio-protocol is free to 
read and includes more than 4,000 protocols. 

Creating such a community depends on a broad-based interdisciplinary team, 
and associate editors make up a key component. In the summer and fall of 2021, 
the Bio-protocol editorial team convened a series of virtual check-in interviews with 
the associate editors. "It was great to see their enthusiasm to make improvements,” 
Shamu says. "They made the effort to attend across many time zones.” 

To further expand Bjo-protocol's reach, it has formed strategic collaborations with 
other publications and organizations, including eLife and Science/AAAS. 


Any scientist who ever tried to replicate a protocol knows how difficult it can be 
without adequate information. “Open access to highly detailed protocols is essential 
for research,” says Wolf B. Frommer, Alexander von Humboldt Professor at Heinrich- 
Heine-Universitat in Dusseldorf and a member of the Leopoldina, the German National 
Academy of Sciences. “It saves an immense amount of effort and funds if protocols are 
as detailed as they are on the Bio-protocol website, since there are always little tricks 
or pitfalls that can be easily avoided with sufficient information.” 

A 2018 survey conducted by Bio-protoco/ shows that the plan is working. Of nearly 
6,500 hundred scientists who downloaded protocols between October 2017 and 
January 2018, more than 1,500 responded to the survey. Of the approximately 700 
scientists who had tried the protocols they had downloaded, 91% of them were able to 
reproduce the experiment. This means that scientists can turn to Bio-protocol to find 
tried-and-tested, reproducible methods that can be added to a lab’s toolbox. 

Bio-protocol also runs a sister database called Bio-101, which is an open 
platform for sharing protocols as well as asking and answering questions. “This is 
a very nice platform, which documents all of the everyday basic protocols—such 
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as polyacrylamide gel casting, or how to run a protein gel, or basic microscopy 
techniques,” explains Amey Redkar, postdoctoral fellow at the University of Cordoba in 
Spain and associate editor at Bio-protocol. "If someone works under me as an intern, 
it's a good platform to give them the link to browse or search through for the kinds of 
techniques they might need in their projects." In addition to basic protocols, Bio-101 
has expanded to include preprint protocols describing newly developed methods via 
the “Request a Protocol” initiative (2). 

Sudhir Gopal Tattikota, a postdoctoral researcher in the lab of Norbert Perrimon at 
Harvard Medical School, sees clear benefits for researchers in Bio-protocol's Bio-101 
online features. “Several readers simply approached us via the Bio-101 ‘Request a 
detailed protocol’ option,” he says, “and we were able to immediately send them 
detailed and stepwise protocols, which are sometimes difficult to incorporate within 
research articles.” 

In addition to helping readers, Bio-protoco! benefits the authors as well. “It's great 
having a single, shared version of protocols,” says Maecker. “It's made it easy for us to 
refer to those protocols in other papers and easy to share them with fellow scientists.” 

Plus, a Bio-protoco/ author must think very carefully and precisely about even the 
most basic elements of a method. “The idea is to write a protocol that is as detailed 
as possible—exactly which reagents, which type of plastic, which type of culture 
medium, which type of seal we used,” says Zanoni. "It is very useful, because other 
people who read it can reproduce your data and your experiments, and you will 
be cited.” 


Growing the community 
After more than a decade of working with life scientists around the world, the Bio- 
protocol organization plans to build an even bigger community of scientists dedicated 
to reproducibility. The basis of that work revolves around communication. 

To make interactions even easier for life scientists, Bio-protocol includes a Q&A 
section at the end of each article. But communication can always be improved. "We 
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continually strive for improvements that facilitate efficient communication between 
scientists,” He says. 

Improving that communication depends largely on the reach of Bio-protoco/. One 
way that the organization hopes to expand its number of users is through its new 
Ambassador Program. As one of the ambassadors, Redkar describes it as a "program 
to promote this platform mainly to younger researchers—especially the undergrads 
who are in the lab for the first time.” He adds, “It is just to make them aware of what 
Bio-protocol is and how it would benefit them in their everyday lab work.” 

The growing participation and engagement of authors and readers confirm 
the beneficial role that Bio-protocol—as an organization as well as a journal—is 
playing in the life sciences. As McNutt added in her editorial, exposing the issues 
of reproducibility “ensures that science moves forward, through independent 
verifications as well as the course corrections that come from refutations and the 
objective examination of the resulting data.” That's just what He hoped for when she 
came up with the idea of Bio-protocol. 

When scientists can reproduce a protocol, it helps everyone. As Zanoni says: “One 
of the best things is when another lab on the other side of the world can reproduce 


what you have done.” 


References 


1, M. McNutt, Science 346, 679 (2014); available at https://www.science.org/doi/10.1126/ 
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2. “Request a Protocol (RaP),” Bio-101, Bio-protocol, https://bio-101.net/rap. 
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438 News at a glance 


441 Deadly flu spreads through North 
American birds 

As largest ever H5N1 outbreak hits poultry 
and wild species, researchers wonder whether 
virus is here to stay By £. Stokstad 
PERSPECTIVE p. 459 


442 India’s speedy vaccine approvals 
come under fire 

Critics say regulatory agency lacks key 
capabilities and independence By P Pulla 


444 NYU may hire biologist pushed out 

of MIT 

Med school is in talks with David Sabatini, despite 
sexual misconduct findings By M. Wadman 


445 Nicaragua’s universities stagger 

under government pressure 

Attacks on higher education autonomy put 
research—and researchers—at risk By S. Moutinho 


446 First self-copying mRNA vaccine 
proves itself in pandemic trial 

Twist on current vaccines reduces dose, 
eases distribution By J. Cohen 


447 Math professor’s trial next test of 
China Initiative 

Mingging Xiao is fighting U.S. charges 
that he failed to disclose ties to Chinese 
institutions By J. Mervis 
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448 A quantum sense for dark matter 
By harnessing the strange rules of the 
subatomic realm, quantum sensors 
could solve one of the universe's biggest 
mysteries By A. Cho 

PODCAST 


INSIGHTS 


452 A stark future for ocean life 

Model predicts a mass extinction event 

in the oceans if climate change is uncurbed 
By M.L. Pinsky and A. Fredston 

REPORT p. 524. 


453 From outbreaks to endemic disease 
Relief from the effects of epidemics may 
signal the start of low-level disease 
persistence By M. F. Antolin 

REPORT p. 512 


455 A granular approach to electrode 
design 

The consistency of cathode particles plays a 
pivotal role in battery performance By J. Xiao 
REPORT p. 517 


456 Why do animals want what they like? 
As in mammals, honey bee motivation for 
wanting rewards is modulated by dopamine 
By J. E. Garcia and A. G. Dyer 

REPORT p. 508 


457 An adsorbent with flexible 
nanoscopic pores 

Scanning transmission electron microscopy 
shows the adaptive pores of a zeolite 

By T. Willhammar and X. Zou 

RESEARCH ARTICLE p. 491 


459 Resurgence of avian influenza virus 
Unprecedented outbreaks of the H5N1 highly 
pathogenic avian influenza virus raise 
concern By M. Wille and |. G. Barr 

NEWS STORY p. 441 


461 Eugene N. Parker (1927-2022) 
Pioneer of theoretical solar and plasma 
astrophysics By S. D. Bale 


462 Aglobal system for the next 
generation of vaccines 

COVID-19 has shown that hurdles can be 
overcome By N. Arinaminpathy et al. 


466 Landscapes of the Anthropocene 
The late artist Per Kirkeby’s preoccupation 
with geology is on display in a new exhibition 
By D. Dixon 


467 Rethinking the “Western” 
revolution in science 

A historian sees global cultural and 
geopolitical roots in Europe's scientific 
breakthroughs By J. Cafiizares-Esguerra 
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(LENSING MAP) NASA/STSCI; ESO WFI; MAGELLAN/U. ARIZONA/D. CLOWE ET AL.; (BOTTOM) COURTESY OF MICHAEL WERNER GALLERY, NEW YORK AND LONDON 


CREDITS: (TOP) (X-RAY) NASA/CXC/CFA/M. MARKEVITCH ET AL.; (OPTICAL) NASA/STSCI; MAGELLAN/U. ARIZONA/D. CLOWE ETAL.; 
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LETTERS 


468 Retraction 
By R. L. Reyes et al. 


468 Evolutionary risks of osprey 
translocations 
By F. Monti et al. 


469 How to weaken Russian oil 
and gas strength 
By R. Hausmann et al. 


469 A global plastic treaty must 
cap production 
By M. Bergmann et al. 


470 Errata 
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471 From Science and other journals 


RESEARCH ARTICLES 


474 Cancer 

Stepwise-edited, human melanoma models 
reveal mutations’ effect on tumor and 
microenvironment 

E. Hodis et al. 


RESEARCH ARTICLE SUMMARY; FOR FULL TEXT: 
DOIORG/10.1126/SCIENCE.ABI8175 
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475 Dog genomics 
Ancestry-inclusive dog genomics challenges 
popular breed stereotypes K. Morrill et al. 


RESEARCH ARTICLE SUMMARY; FOR FULL TEXT: 
DOILORG/10.1126/SCIENCE.ABK0639 


476 Cancer 

Cancer cells use self-inflicted DNA breaks to 
evade growth limits imposed by genotoxic 
stress B. D. Larsen et al. 


483 Neuroscience 
Molecular and neural basis of pleasant touch 
sensation B. Liu et al. 


491 Zeolites 

In situ imaging of the sorption-induced 
subcell topological flexibility of a rigid zeolite 
framework H. Xiong et al. 

PERSPECTIVE p. 457 


REPORTS 


496 Dynamic genome 

Dynamics of CTCF- and cohesin-mediated 
chromatin looping revealed by live-cell 
imaging M. Gabriele et al. 


502 Hydrogels 

Piezoionic mechanoreceptors: Force-induced 
current generation in hydrogels 

Y. Dobashi et al. 


508 Neuroethology 

Food wanting is mediated by transient 
activation of dopaminergic signaling in the 
honey bee brain J. Huang et al. 
PERSPECTIVE p. 456 


512 Epidemiology 

Rabies shows how scale of transmission 
can enable acute infections to persist at low 
prevalence R. Mancy et al. 

PERSPECTIVE p. 453; PODCAST 


517 Batteries 

Dynamics of particle network in 
composite battery cathodes J. Li et al. 
PERSPECTIVE p. 455 


521 Gravitational waves 

A gamma-ray pulsar timing array 
constrains the nanohertz gravitational 
wave background 

The Fermi-LAT Collaboration 


524 Extinction 

Avoiding ocean mass extinction from climate 
warming J. L. Penn and C. Deutsch 
PERSPECTIVE p. 452 


527 Organic chemistry 

Scaffold hopping by net photochemical 
carbon deletion of azaarenes 

J. Woo et al. 


532 Organic chemistry 
Accelerating reaction generality and 
mechanistic insight through additive 
mapping C. N. Prieto Kullmer et al. 


DEPARTMENTS 


437 Editorial 
Removing a barrier to equity By A. Mapp 


542 Working Life 
Scientist and artist By A. Bashir 


ON THE COVER 


Portraits of pet dogs exemplify the assortment 
of physical characteristics seen in purebred and 
mixed-breed dogs. Additionally, different breeds 
are often thought to have distinctive behaviors 
and personalities. However, data from thousands 
of dogs involved in the Darwin's Ark project reveal 
that this is rarely true, 

and that studying the full 
diversity of dogs is the 
best way to understand 
interactions between 
we f genetics, ancestry, and 
complex traits such as 
behavior. See page 475. 
Photos: Elke Vogelsang 
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EDITORIAL 


Removing a barrier to equity 


arlier this year, the University of Michigan be- 
came the first US university to remove the re- 
quirement that applicants to its nonprofessional 
doctoral programs take a standardized test—the 
Graduate Record Examination (GRE). This deci- 
sion will not, on its own, address inequities in 
admissions practice, nor the broader education 
barriers that many applicants face. But it is a major 
step toward an admissions process that considers all 
dimensions of a candidate’s preparation and prom- 
ise—a holistic view that should be adopted by all uni- 
versities if equity in education and opportunities is to 
be achieved. 

The stated goal of the GRE general test is to assess 
writing skills and quantitative and verbal reasoning. 
Its actual value as an instru- 
ment for determining who is 
likely to be successful in a doc- 
toral graduate program, how- 
ever, has not been established. 
There is a positive correlation 
across disciplines between GRE 
scores and first-year gradu- 
ate course GPA (grade point 
average). However, although 
coursework is part of obtaining 
foundational knowledge in a 
field, the central focus of a doc- 
toral degree comprises intellec- 
tual contributions and research 
accomplishments (measured in many dimensions, 
such as publications, presentations, and patents). 

Beyond major questions about GRE utility, there are 
substantive financial and opportunity costs associated 
with requiring the GRE: costs for applicants and costs 
for admissions committees. For the potential applicant, 
the test is expensive; registration alone has a price tag 
of US$205, with additional fees for score reporting. (A 
fee reduction is possible for qualified applicants.) Many 
feel pressure to enroll in expensive GRE prep courses that 
teach test-taking strategies and often offer money-back 
guarantees for competitive scores and score improve- 
ments. The GRE 162+ course offered by Princeton Review, 
for example, guarantees scores of at least 162 (out of 170) 
at a cost of US$2149. 

What are the costs for admissions committees that use 
the GRE in admissions decisions? In short, the loss of tal- 
ented applicants at every stage of the process. Students 
who do not have the financial means to prepare for and/ 
or take the test will not apply. A further loss comes from 
potential applicants who lack access to the test, either be- 


“d truly holistic 
admissions process... 


requires more 
than dropping 
standardized tests.” 


cause of the limited availability of physical testing sites or 
because of lack of access to wireless networks needed for 
online test-taking options. Once applications have arrived 
for review by a doctoral admissions committee, use of the 
GRE can lead to additional loss of talented applicants. 
Despite recommendations that GRE scores only be used 
in the context of an overall evaluation of an applicant, in 
practice they can readily be employed—implicitly or ex- 
plicitly—as cutoffs for further consideration of an applica- 
tion. This is an especially problematic practice given that 
GRE scores consistently correlate with the sex, ethnicity, 
socioeconomic status, and race of the test taker. 

Discontinuing the use of the GRE in doctoral admis- 
sions is, therefore, a valuable step toward best practices 
to reduce the barriers—systemic and otherwise—faced by 
students, to decrease bias in the 
evaluation process, and to sup- 
port admission and recruitment 
of outstanding student cohorts. A 
concern is that in the absence of 
the GRE score, admissions com- 
mittees will rely more heavily on 
other aspects of a candidate's ap- 
plication that—like the GRE—are 
strongly correlated with race, sex, 
and socioeconomic factors rather 
than talent and promise. This 
includes undergraduate institu- 
tion ranking, prestigious intern- 
ships, and performance on other 
standardized tests. It is for exactly this reason that frank 
conversations about discordance between the stated goals 
and values of doctoral programs and the common prac- 
tices in doctoral admissions are indispensable. 

In recent years, many individual graduate programs 
have removed the GRE requirement, with notable suc- 
cess. But implementing a truly holistic admissions pro- 
cess at the university level requires more than dropping 
standardized tests. Faculty need education and institu- 
tional support to develop effective and equitable evalu- 
ation rubrics and to learn how to better use the rich 
information in graduate applications rather than single 
data points in their evaluative processes. The continual 
collection and analysis of data surrounding all aspects of 
the student life cycle—from recruitment through gradu- 
ation—are also critical. In this way, changes across many 
dimensions that correlate with GRE general test discon- 
tinuation and evolving holistic admissions practices can 
be identified and can inform future goals for the graduate 
education community at large. 

-Anna Mapp 
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46 [he reality is that there is a loss of momentum. 99 


Isaac Adewole, a physician and consultant for the Africa Centres for Disease Control and 
Prevention, in The New York Times, on data showing the world is falling far short of a goal of vaccinating 
70% of adults against COVID-19. Developing countries are more likely to have low rates. 
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Acliff marks the edge of an ancient delta, where the Perseverance rover will collect rocks for testing. 


PLANETARY SCIENCE 


Rover reaches Mars river delta 


ASA’s Perseverance rover last week reached the primary target 
of its mission, an ancient river delta, a fan-shaped collection of 
rocks and sediments, frozen in time billions of years ago, that 
spills from the rim of Jezero crater to its floor. Operators plan to 
comb the fossilized delta sediments for rocks that could contain 
organic molecules indicative of past martian life. The $2.7 billion 
rover landed on Mars in February 2021, studying volcanic rocks on the 
crater floor for a year before trundling to the delta, which ends in a 
cliff face at the crater’s rim. The rover’s team plans to drill and collect 
approximately 30 rock samples, which are expected to be returned to 
Earth in future missions that NASA and the European Space Agency 


plan to launch in 2028. 


Atom smasher powers up again 


PARTICLE PHySics | The world’s larg- 

est atom smasher, the Large Hadron 
Collider (LHC) at CERN, the European 
particle physics laboratory, started up 

last week after more than 3 years of 
upgrades. The 27-kilometer-long accelera- 
tor famously blasted out the long-sought 
Higgs boson in 2012, and it last took data 
in December 2018. Since then, technicians 
have improved the accelerators that feed 
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protons into countercirculating beams 
and the four large detectors that study the 
resulting proton collisions. For the next 
few months, researchers will work out 

the bugs and slowly increase the beams’ 
energy. Data recording—and the search 
for new and unexpected particles—should 
begin this summer. In 2025, the LHC will 
shut down for an even more extensive 
upgrade to greatly increase the intensity of 
its beams, which would increase the rate 
of proton collisions. 


Biden resets environmental rules 


POLiIcy | Reversing decisions made by 
former President Donald Trump, the 
administration of President Joe Biden last 
week released new guidelines expanding 
the technical issues that must be assessed 
in environmental reviews of a wide range 
of federally funded activities, includ- 

ing major construction projects. Trump 
had narrowed the scope of the reviews 
required by the National Environmental 
Policy Act (NEPA). For example, federal 
agencies were directed to consider only 
the direct, near-term effects of an activity, 
meaning they could ignore longer term 
issues, such as a project’s impact on future 
climate change. The Biden administration 
says it plans to release a second set of 
NEPA guideline revisions that will 
address other controversial alterations 
made by Trump, including tight time 
limits on reviews. 


Restore degraded land, U.N. urges 


SUSTAINABILITY | Reversing global land 
degradation can help alleviate three big 
problems—the effects of climate change, 
biodiversity loss, and food insecurity, 

a U.N. report says. The Global Land 
Outlook 2, released on 27 April by the 
United Nations Convention to Combat 
Desertification, quantifies the toll taken 
by development, deforestation, agricul- 
ture, and other human activities on soil 
health. If current trends continue, the 
area of degraded land will equal the size 
of South America and global crop and 
ecosystem productivity will drop as much 
as 14% by 2050. Sub-Saharan Africa will 
fare the worst. But through steps such 

as planting trees and stabilizing ero- 

sion on 5 billion hectares, countries can 
slow this decline, increase crop yields, 
and lock up more atmospheric carbon in 
soils. Already more than 100 nations have 
pledged to restore 1 billion hectares by 
2030. Redirecting money spent on fossil 
fuel and farming subsidies can help pay 
for restoration, the report says. It says res- 
toration means not just creating protected 
wildlands, but also taking active steps 

to enable land to support farming, tree 
plantations, and grazing. 
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PARTICLE PHYSICS 


Fermilab neutrino source gets green light for construction 


ermi National Accelerator Laboratory (Fermilab) last week 
won final approval to build a new superconducting linear 
accelerator that will generate an intense beam of protons 
with an energy of 800 million electron volts. Researchers will 
use that beam to create elusive particles called neutrinos and 
shoot them 1300 kilometers through Earth at a planned under- 
ground detector in South Dakota for high-priority experiments. 


The new accelerator, the Proton Improvement Plan-ll, replaces 

an older one with half the energy. Workers had already begun to 
construct buildings for the new machine; last week's go-ahead 
from the U.S. Department of Energy lets the lab build the actual 
accelerator. All told, the project will cost $978 million, not includ- 
ing equipment contributions from other countries totaling roughly 
$330 million. Completion is expected in 2032. 


Anew U.S. accelerator will speed up particles using a series of these five-cell, superconducting, radio-frequency cavities. 


Vikings sold ivory in Ukraine 


ARCHAEOLOGY | Vikings shipped walrus 
ivory from Greenland far away to Kyiv, 
according to new analyses of ancient walrus 
skulls and ivory figurines discovered in the 
Ukrainian capital. The 4000-kilometer trade 
route is much longer than was thought 
based on previous studies, which suggested 
eastern users of the coveted material 
sourced it only from walruses caught in 
the Russian arctic. After discovering fin- 
ished ivory artifacts and discarded walrus 
skull fragments in a Kyiv layer dating to 
the 1100s C.E., an international 
team used DNA and chemical 
analyses to link the material 
to a genetic group of walruses 
found only in the western 
Atlantic Ocean. Vikings 
appear to have overhunted 
the Greenlandic walruses, 
and the authors say con- 
sumer demand in Eastern 
Europe may have been 


Walrus ivory, used in this carved 
specimen from Norway, was widely 
prized in early medieval Europe. 
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responsible. A decline in harvest could also 
help explain why Norse settlers abandoned 
Greenland in the 1300s. 


Asteroid explorer gets 2nd target 


PLANETARY SCIENCE | Next year, after 
swinging past Earth and dropping off a 
cargo of rocks it collected from the asteroid 
Bennu, NASA’s OSIRIS-REx spacecraft will 
embark on a new mission to the asteroid 
Apophis, the agency said this week. In 2029, 
the 300-meter-wide Apophis will zip past 
Earth in the closest flyby of a large asteroid 
in modern records; at one-tenth the distance 
to the Moon, it will be visible to the naked 
eye. Afterward, the renamed OSIRIS-Apex 
mission will visit and collect data from the 
asteroid for 18 months. Although the space- 
craft cannot collect any more rocks, it will 
gauge how Apophis’s orbit was perturbed 
by Earth’s gravity and conduct an approach 
maneuver to scorch off the asteroid’s surface 
layer, exposing hidden layers below. 


Most in U.S. show signs of infection 


covib-19 | Nearly three in five US. resi- 
dents had been infected with SARS-CoV-2 


by February, a sign of the speed with which 
the highly contagious Omicron variant 
surged, the Centers for Disease Control and 
Prevention reported this week. Starting in 
September 2021, researchers analyzed tens 
of thousands of blood samples collected 
each month, measuring antibodies that 

are generated by exposure to the virus but 
not by the vaccines available in the United 
States. From December 2021 to February, 
as Omicron took off, the portion of samples 
with the antibodies rose from 34% to 58% 
across all age groups. In children and ado- 
lescents, the least vaccinated age groups, 
the proportion rose from 45% to 75%, the 
researchers reported on 26 April in the 
Morbidity and Mortality Weekly Report. 
Other studies have shown that COVID-19 
vaccines protect most recipients against 
severe illness, but effectiveness against 
infection, especially by Omicron, wanes. 


Avoidable COVID-19 deaths tallied 


PUBLIC HEALTH | About 234,000 U.S. 
deaths from COVID-19 since June 2021 could 
have been prevented if all eligible adults had 
received the primary series of vaccinations, 
an analysis by the Kaiser Family Foundation 
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estimated. The toll represents 60% of all 
adult COVID-19 deaths since then and one- 


Shelled, shot, bombed 


This partial list of Ukrainian researchers killed, based 


quarter of the nearly 1 million U.S. deaths on Science reporting, shows the war's toll on civilians. 
from COVID-19 since 2020. ease 
NAME SPECIALTY OF DEATH 
Pandemic cut kids’ vax rates Oleksandr Korsun Chemistry Shelling 
INFECTIOUS DISEASES | COVID-19 Oleg Amosov Economics Shelling 
drove down the rate at which U.S. Yulia Zdanovska Math Shelling 
kindergartners received three routine Yevhen Khrykov Education research Shot 
vaccinations during the 2020-21 school - 
year, the Centers for Disease Control and Vasyl Kladko Physics Shot 
Prevention said last week. In 47 states, Andriy Kravchenko Chemistry Land mine 


percentages of those receiving the diph- 
theria, tetanus, and acellular pertussis 
vaccine; the measles, mumps, and rubella 
vaccine; and the varicella (chickenpox) 
vaccine fell to about 94%, just below the 
95% mark considered necessary to main- 
tain populationwide immunity. The 
agency blamed missed checkups because 
of the pandemic. 


crystallo-grapher at the V.E. Lashkaryov 
Institute of Semiconductor Physics, slain 
last month in a Kyiv suburb. In Kharkiv, 
scientists have died in rocket attacks. 

And in early April, Andriy Kravchenko, a 
chemist at the Chuiko Institute of Surface 
Chemistry, died when a land mine shred- 
ded his car near Kyiv. A few days earlier, his 
team delivered to a hospital the first batch 
of something he’d spent years develop- 
ing for battlefield use: a topical coagulant 
that stanches bleeding until a doctor can 
reach an injured soldier. “He dreamed it 
would appear in the first-aid kit of every 
Ukrainian soldier,’ says Chuiko colleague 
Mariia Galaburda. “Such a heavy and 
painful loss.” 


Ukraine’s fallen include scientists 


CONFLicT | As the war in Ukraine grinds 
into a third month, deaths are mounting— 
and scholars young and old are among 
the casualties. Russian soldiers have 
gunned down scientists in cold blood, 
including Vasyl Kladko, an x-ray 


CONSERVATION 
Many reptile species face risk of extinction 


alf of turtle and crocodile species are in danger, says an extensive global review of 

reptiles. Experts evaluated data for 10,196 species in 10 major taxonomic groups 

of reptiles and assessed their conservation status. Hunting and fishing contributed 

to the high figure for turtle species, the research team reports this week in Nature. 

Overall, 18% of reptile species are threatened; the figure rose to 21% when the 
scientists extrapolated to include data-deficient species. Reptiles are less at risk than 
amphibians (41%) and mammals (25%) but worse off than birds (14%). 


Grim reckoning 
In some orders of reptiles, more than 50% of species were deemed threatened, grouped in one of three 
categories of risk used by the International Union for Conservation of Nature. (Some orders are not shown.) 


Conservation status @ Critically endangered © Vulnerable Other 


Testudines (tortoises, terrapins, and other turtles) 
Crocodylia (crocodiles, alligators, etc.) 
Iguania (iguanas, chameleons, etc.) 


Serpentes (snakes suborder of Squamata) 


All reptiles 


@ Endangered 


25 50 75 
% of species 


100 
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U.K. funder regrets IDing critics 


WORKPLACE | The funding agency UK 
Research and Innovation last week apolo- 
gized for encouraging a UKRI-linked analytics 
firm to report critical comments made by 
academics—a move researchers said was a 
threat to their academic freedom. Scholars 
had tweeted criticisms about the Researchfish 
software platform, which collects data for 
UKRI on the impact of its research grants; 
the platform then told them it had shared its 
concerns about the tweet with the research- 
ers’ funders. In its apology statement, UKRI 
said Researchfish had flagged six tweets 

over 4 years. The agency called its approach 
“wrong” and promised to stop it. “At no point 
was this ever intended, or used, to affect cur- 
rent or future grants from UKRI.” 


Swedish pandemic leader stays put 


NEWSMAKERS | Anders Tegnell, former chief 
epidemiologist at Sweden’s Public Health 
Agency who drew controversy over his coun- 
try’s subdued response to COVID-19 (Science, 
9 October 2020, p. 159), will remain with the 
agency after a reported new position consult- 
ing for the World Health Organization on 
pandemic vaccinations didn’t materialize. “An 
agreement could not be reached,” the Swedish 
agency said last week, adding that Tegnell will 
now work on international issues. 


Prenatal tests get FDA warning 


GENETICS | Screening evaluations that ana- 
lyze blood from pregnant people to look for 
abnormalities in fetal DNA can be unreliable, 
the U.S. Food and Drug Administration (FDA) 
warned last week. The tests, marketed in 

the United States by several companies, are 
less invasive than diagnostic tests that use 
amniotic fluid or placenta samples, which in 
rare cases cause miscarriage. But the blood 
tests have often incorrectly indicated rare 
genetic conditions, such as those caused by 
small missing pieces of chromosomes, FDA 
said on 19 April. It noted, for example, that 
follow-up tests fail to confirm seven of 10 
positive results for a rare genetic condition 
called DiGeorge syndrome. Nevertheless, 
some parents have chosen to end pregnan- 
cies based on these results alone, it said. FDA 
doesn’t regulate the tests, and most makers 
of microdeletion tests don’t publicize usage, 
The New York Times reported in January. 
FDA recommends patients consult a health 
care provider before deciding to take prenatal 
blood tests and to confirm any positive result 
with further tests. 
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Deadly flu spreads through 
North American birds 


As largest ever H5N1 outbreak hits poultry and wild 
species, researchers wonder whether virus is here to stay 


By Erik Stokstad 


hen black vultures began to die 

at Florida’s Hontoon Island State 

Park in February, rangers called in 

investigators from the state’s Fish 

and Wildlife Conservation Com- 

mission. They soon concluded 
a virus that has devastated domesticated 
birds worldwide had reached the vultures: 
a strain of highly pathogenic avian influ- 
enza (HPAI) known as H5dN1. The vultures 
had likely acquired the virus from eating 
infected waterbirds—as well as by cannibal- 
izing their own kind. 

Workers removed more than 200 carcasses 
in a bid to contain the outbreak. But Mark 
Cunningham, a wildlife veterinarian with the 
commission, thinks the effort was probably 
futile. “Tt’s hard to see this chain of infection 
really breaking anytime soon,” he says. 

That’s a fear shared by researchers and 
poultry farmers across North America, who 
in recent weeks have been urgently docu- 
menting and trying to contain the continent’s 
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largest outbreak of HPAI (p. 459). Since the 
virus was first spotted in eastern Canada in 
November 2021, it has been spreading across 
the continent with migrating waterfowl (see 
map, p. 442). Poultry farmers have killed 
nearly 33 million chickens and turkeys in a 
bid to save other flocks and curb economic 
losses. Meanwhile, the virus has killed an un- 
told number of wild birds; researchers have 
so far documented infections in 51 species, 
including bald eagles and great horned owls. 
That’s more than twice the number of species 
known to have been infected during the last 
North American HPAI outbreak, in 2014-15. 

HPAI can be far deadlier to birds than 
seasonal flus are to people, and each out- 
break stirs fears about human infection. 
The current wave has produced no known 
human cases in North America, however, 
much to the relief of public health experts 
already battling COVID-19. 

Still, the scope of the HPAI outbreak 
“boggles the mind,” says disease ecologist 
Nichola Hill of the University of Massachu- 
setts, Boston. She’s one of many research- 


Despite treatment, this bald eagle in Wisconsin 
became one of many wild birds killed by avian flu. 


ers scrambling to understand how the virus 
might spread to mammals and whether it 
will hang on indefinitely in North America, 
as it has in Europe and Asia. “It’s every- 
one on board, at max capacity,’ says Susan 
Shriner, an ornithologist at the U.S. Depart- 
ment of Agriculture, which is helping coor- 
dinate the research effort. 

The most important HPAI lineage, part 
of the H5 group of viruses, arose in the 
late 1990s in domestic geese in Asia. Soon 
it reassorted with flu strains found in wild 
waterbirds. In poultry, infections cause 
pneumonia, seizures, and hemorrhaging 
with mortality rates of up to 100%. Further 
mutations enabled those early waves of H5 
viruses to infect people—they have killed 
more than 456 since 2013—raising fears 
that the viruses could cause a pandemic. 
But so far, they have not gained the ability 
to readily spread from person to person. 

The H5 viruses did, however, cause cat- 
astrophic losses of poultry in Southeast 
Asia. And migratory birds carried the H5N1 
strain out of Asia, first to Europe, where it 
killed an array of water birds, predatory 
birds, and scavengers such as buzzards. 
During the earliest outbreaks, the risk was 
highest during peak fall migration, when 
waterfowl arrived in Europe. But in the past 
2 years, the virus has become endemic in 
Europe, present at some level year-round 
in wild birds. The virus “is not something 
that is going to go away anytime soon,’ says 
Arjan Stegeman, a veterinary epidemio- 
logist at Utrecht University. 

Because of the persistence of the 
virus—and the emergence of an apparently 
more pathogenic strain of H5N1—Europe 
has been experiencing ever-worsening 
HPAI outbreaks in both domestic and wild 
flocks. Farmers have had to undertake 
massive culls, and producers of free-range 
poultry have been forced to move their 
flocks indoors. Sixty-two wild species have 
been found infected in Europe and the 
Middle East in the past 4 months, with 
some—including barnacle geese, Dalmatian 
pelicans, and common cranes in Israel— 
suffering worrisome losses. 

In North America, officials have been keep- 
ing a wary eye on H5NI1. In 2014, migrating 
birds brought a related virus, H5N8, from 
Asia to the U.S Pacific Northwest, spark- 
ing an outbreak that ultimately caused U.S. 
farmers in 15 states to kill some 50 million 
chickens and turkeys and tally $3 billion in 
losses. This time, H5N1 appears to have ar- 
rived from Europe. Last year, after surveys 
found the strain circulating at high levels 
among wild birds in Western Europe, U.S. 
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officials increased their vigilance and re- 
quested funds to sample more waterbirds 
killed by hunters along the Atlantic and Pa- 
cific flyways. 

In December 2021, several hundred birds 
died at a petting farm in Newfoundland and 
tested positive. The next month tests showed 
a duck killed by a hunter in South Carolina 
was carrying H5NI1. By February, the virus 
had reached the confluence of the Ohio and 
Mississippi rivers, where the first farm re- 
ported an infection. Since then, migratory 
birds have spread the virus into the Missouri 
River Basin and the Great Plains. 

Researchers have not yet tested the 
transmissibility of the virus, but they sus- 
pect it spreads more easily than previous 
strains. That would mean a higher propor- 
tion of migratory birds get infected, the 
geographic spread is wider, and there’s 


so researchers are “cautiously optimistic ... 
that we will not see tremendous impacts,” 
Richards says. 

Just two songbird species have tested posi- 
tive: blue jays and crows. At greater risk are 
water birds, especially those that form dense 
nesting colonies, and birds that prey on them. 
Bald eagles often hunt ducks and geese, and 
some have apparently infected their young 
by feeding them virus-laden prey, says 
Rebecca Poulson, a wildlife disease re- 
searcher with the Southeastern Cooperative 
Wildlife Disease Study at the University of 
Georgia. As of 26 April, 48 eagles known to 
be infected had died. The eagle deaths—and 
those of less charismatic birds—are “really 
heartbreaking,” she says. “We’ve just had our 
heads hung low some days.” 

In 2015, Richards says, infections among 
wild birds petered out in summer as they 


A fowl plague 


H5N1, a highly pathogenic avian flu virus, arrived from Europe in late 2021. Waterfowl have since spread 
the virus to other species, including poultry, forcing farmers to kill millions of chickens and turkeys. 
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a higher prevalence in waterbirds—and 
hence more spillover into poultry and wild 
birds, says Bryan Richards, emerging dis- 
ease coordinator at the U.S. Geological Sur- 
vey’s National Wildlife Health Center. 
Genetic analyses of the virus suggest 
introductions to farms are coming primar- 
ily from nearby wild birds. In contrast, re- 
searchers believe that during the 2014-15 
outbreak of H5N8 humans often acciden- 
tally moved the virus from farm to farm. 
To prevent infections, many zoos have 
moved their captive birds indoors or away 
from visitors. It’s much harder to protect 
wild birds, however, raising fears that the 
virus could threaten endangered species, 
especially those with small populations. So 
far, however, the number of detected infec- 
tions in wild populations is relatively low, 
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moved north and dispersed across their 
nesting grounds, and as lakes and wetlands 
warmed, creating conditions inhospitable 
for the virus. This summer, “Knock on 
wood, we should see a substantial waning 
of impacts in backyard flocks and commer- 
cial operations as well,” he says. 

But the threat could re-emerge when birds 
start to migrate south in September. To help 
farmers stay on guard, the United States will 
nearly double surveillance efforts. One fear, 
says Thijs Kuiken, an avian influenza expert 
at Erasmus University Rotterdam, is that 
H5N1 will spread south of the U.S. border. 
Farms there are likely to be more vulnerable. 

“People in Central America and South 
America really need to be aware,’ he says, 
“that this virus is likely to arrive on their 
doorstep—if it hasn’t already.” 


India’s speedy 
vaccine 
approvals come 
under fire 


Critics say regulatory 
agency lacks key capabilities 
and independence 


By Priyanka Pulla 


COVID-19 vaccine named Corbevax 

looked like a triumph for India’s 

burgeoning drug industry. Because 

its U.S. developers hadn’t claimed a 

patent on it, an Indian manufacturer 

named Biological E was able to sell 
the two-dose protein-based vaccine to the 
government at the extraordinarily low price 
of 145 rupees ($1.90) per dose. In March, 
the country began to give the shots to 12- to 
14-year-olds, a group for which India did not 
yet have a licensed COVID-19 vaccine. 

But the celebration was quickly drowned 
out by questions over whether India’s drug 
regulator, the Central Drugs Standard Con- 
trol Organization (CDSCO), had properly 
vetted the vaccine. 

In February, CDSCO had authorized the 
use of Corbevax for adolescents ages 12 to 18. 
But within weeks, the Indian media outlet 
The Wire Science revealed that the National 
Technical Advisory Group on Immunisation 
(NTAGI), an expert group that advises the 
health ministry on which vaccines to add 
to the national immunization program, had 
questioned whether Biological E had shown 
the vaccine is effective. In adolescents, who 
are at a lower risk of severe COVID-19, the 
benefits of a vaccine should be beyond any 
doubt, NTAGI member Jayaprakash Muliyil 
tells Science: “Anytime you vaccinate chil- 
dren, you have to be extremely careful.” 

Other CDSCO approvals of COVID-19 vac- 
cines have raised questions as well, both from 
NTAGI and independent experts. The agency 
has used “suboptimal” standards on several 
occasions, says Vineeta Bal, an immunologist 
at India’s National Institute of Immunology. 
That has led some scientists to ask whether 
the agency has the capabilities—and is in- 
dependent enough—to oversee the quality 
of medicines for India’s 1.4 billion people. 
The implications go beyond India, because 
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the country is a major global medicine sup- 
plier. The World Health Organization has 
“prequalified” 54 vaccines produced in India 
for use elsewhere, and WHO relies on CDSCO 
to oversee the manufacturers. 

CDSCO didn’t respond to questions from 
Science about the criticism. In May 2020, In- 
dia’s health ministry appointed a committee 
to advise it on how to restructure India’s drug 
regulatory system in line with global best 
practices, but that committee’s recommen- 
dations haven’t been published. It’s unclear 
whether they will address vaccine regulation. 

CDSCO has a decent reputation: Based on 
an extensive assessment, WHO concluded in 
2017 that it was a “functional” drug regula- 
tor, a distinction only 30% of its counterparts 
around the world enjoy. (It ranks a step be- 
low 11 agencies WHO credits with a “high 
level of performance,’ however, including the 
US. Food and Drug Administration and the 
European Medicines Agency.) 

But the pandemic has been challenging 
for CDSCO, says Gagandeep Kang, a public 
health microbiologist at Christian Medical 
College, Vellore. For the first time, Indian 
manufacturers developed new vaccines 
rather than me-too versions of shots ap- 
proved elsewhere, and the agency lacked 
some types of expertise, Kang says, such 
as the ability to reanalyze data from key 
studies. Others suspect CDSCO’s decisions 
have been skewed by political pressure to 
quickly approve made-in-India vaccines, 
which have been a point of pride for the 
ruling Bharatiya Janata Party. “The gov- 
ernment has a keenness of publicity, and 
that has certainly created pressure,” says T. 
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Sundararaman, an independent public 
health analyst based in Puducherry. 

In January 2021, for example, the agency 
greenlit Covaxin, an inactivated-virus vaccine 
produced by Bharat Biotech, without data 
from large-scale efficacy trials—only phase 
2 data about the immune response gener- 
ated by the vaccine. By the time the company 
published data showing 78% efficacy against 
symptomatic COVID-19, 6 months later, mil- 
lions of Indians had already received the shot. 

NTAGI also differed with CDSCO’s as- 
sessment when the regulator approved a 
COVID-19 vaccine named ZyCoV-D for use 
in both adults and adolescents in August 
2021. Produced by Zydus Cadila in Gujarat 
state, ZyCoV-D is the first DNA vaccine ap- 
proved by any country for use in humans. 
CDSCO based its decision on results of a trial 
in about 28,000 participants over 12 years of 
age, which found the vaccine 67% efficacious 
at preventing symptomatic COVID-19. 

NTAGI, which does not typically make 
its advice public, opposed the use in ado- 
lescents, Muliyil says; it felt a completely 
new vaccine platform should only be used 
in adults at first. Moreover, the phase 3 trial 
had a single efficacy estimate for all ages, 
says another NITAGI member who asked not 
to be identified, even though efficacy can 
differ by age group. (In Western countries, 
COVID-19 vaccines for adolescents and chil- 
dren were authorized after separate trials in 
those age groups.) 

In March, CDSCO’s reputation took an- 
other hit when a WHO inspection of the Co- 
vaxin manufacturing facility in Hyderabad 
found quality control deficiencies, whose 


Agirl in Kolkata receives a dose of Corbevax, which 
India began to use in 12- to 14-year-olds in March. 


nature WHO has not disclosed. WHO rec- 
ommended that member countries stop 
using the vaccine, and Bharat voluntarily 
halted exports. But the company has down- 
played the problems and says it will keep 
selling Covaxin in India. CDSCO did not re- 
spond to questions from Science about the 
problems or why it failed to spot them. “It 
concerns me that CDSCO, the custodian of 
public health as India’s national drug regu- 
lator, haven’t issued any statements yet on 
this issue,’ says Jayanthi Vuppala, an in- 
dependent expert on good manufacturing 
practice based in Hyderabad. 

Last month’s approval of Corbevax— 
which by now has been given to 30 million 
adolescents—raised more questions. CDSCO 
authorized the vaccine for 12- to 18-year-olds 
based on interim data from a 312-participant 
study that showed the vaccine triggered a rise 
in neutralizing antibodies. But NTAGI wasn’t 
convinced the vaccine was entirely respon- 
sible for the rise, Muliyil says. Data from the 
unvaccinated placebo group could have shed 
light on whether COVID-19 infections were 
also contributing, but as a preprint posted on 
26 April shows, the trial did not assess anti- 
bodies in the placebo group. Biological E did 
not respond to a question about the data. 

Since then Botswana has also licensed Cor- 
bevax. And on 21 April, an expert committee 
advising CDSCO recommended the shot’s ap- 
proval for 5- to 12-year-olds in India. 

Critics applaud the fact that Corbevax is 
free from patents, but they worry its approval 
signals a double standard. “[Would] this vac- 
cine be acceptable for pediatric populations 
in a high-income country with the data we 
currently have in the public domain?” asks 
Boghuma Kabisen Titanji, an infectious dis- 
ease specialist at Emory University. 

Corbevax co-developer Peter Hotez, who 
leads the Texas Children’s Hospital Center 
for Vaccine Development, says Indian com- 
panies have a strong track record for pro- 
ducing WHO-prequalified vaccines. “I’m not 
aware they apply lower bars and standards,” 
Hotez says. “It would be unfair to think so 
and in fact it reflects colonial attitudes.” 

But the controversy has led some Indian 
pediatricians to not recommend the vac- 
cine. And hasty vaccine approvals, espe- 
cially for children and adolescents, pose a 
risk that goes beyond the individual vac- 
cine, Titanji cautions: “It sets a dangerous 
precedent which could be very damaging 
for vaccine confidence and uptake down 
the road” & 


Priyanka Pulla is a science journalist 
in Bengaluru, India. 
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SCIENTIFIC COMMUNITY 


NYU may hire biologist pushed out of MIT 


Med school is in talks with David Sabatini, despite sexual misconduct findings 


By Meredith Wadman 


he New York University (NYU) Gross- 
man School of Medicine is in discus- 
sions with biologist David Sabatini 
about hiring him as a faculty mem- 
ber, according to multiple sources 
at the school. Since August 2021, 
Sabatini has been forced out of or fired from 
three leading institutions for sexual harass- 
ment or for violating workplace or consen- 
sual sexual relationship policies. The hiring 
discussions have been taking shape over 
several weeks, according to NYU sources. 

“Any decision about a potential role [for 
Sabatini] at NYU Grossman School of 
Medicine would be subject to careful 
and extensive due diligence and con- 
sultation with a broad group of stake 
holders,” the school said in a state- 
ment this week. 

“David defers to NYU on this pro- 
cess,” a Sabatini spokesperson said. 

In an email to faculty and staff last 
week, the medical school’s dean de- 
cried what he called “cancel culture.” 
And last month, a prominent philan- 
thropist defended Sabatini to a crowd 
of elite researchers in New York City. 

But numerous faculty members at 
NYU’s medical school are disturbed 
by the prospect of hiring Sabatini, 
saying the move sends a message 
of tolerance for sexual misconduct 
and hostile lab environments. “We 
are deeply concerned that recruiting 
an individual found guilty of having bro- 
ken the policies of prestigious institutions 
such as MIT [the Massachusetts Institute of 
Technology], the Whitehead Institute and 
HHMI [the Howard Hughes Medical Insti- 
tute] may profoundly damage our culture 
and reputation,” reads a letter to the dean 
and vice dean that had 66 faculty signatures 
as of midafternoon Tuesday. 

“Ym concerned for the culture of the in- 
stitution,” said one faculty member, who 
asked to be anonymous for fear of retalia- 
tion by administrators. “The accusations 
made are really serious. ... Is this signaling 
that in the future students that come for- 
ward for any kind of faculty misconduct 
will be disregarded?” 

However, some current and former 
Sabatini lab members are enlisting anony- 
mous signatures for an online letter support- 
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ing him. The letter says the signers “never 
experienced or observed an abusive lab cul- 
ture or a sexualized lab environment, and we 
did not witness sexual harassment. ... Work- 
ing in David’s lab was one of the most influ- 
ential and fortunate events of our careers. ... 
We hope that this letter will contribute posi- 
tively in your decision to hire David.” 
Sabatini, a prominent cancer biologist, 
resigned from MIT earlier this month after 
senior officials recommended revoking his 
tenure. They found he had violated MIT’s 
policy on consensual sexual relationships 
and expressed “significant concerns regard- 
ing his professional behavior to some lab 


Cancer biologist David Sabatini, pictured at a 2019 awards dinner. 


members,” President L. Rafael Reif wrote to 
MIT faculty on 1 April. 

In August 2021, Sabatini was forced out 
of the Whitehead. An outside law firm 
had “found that Dr. Sabatini violated the 
Institute’s policies on sexual harassment, 
among other Whitehead policies unrelated 
to research misconduct,’ Director Ruth 
Lehmann wrote to Whitehead staff. HHMI, 
which had funded Sabatini’s Whitehead lab, 
fired him at the same time, saying he had 
violated workplace behavior policies. 

The institutions have not provided 
details about the behaviors that led to 
Sabatini’s removal, but the Whitehead com- 
piled a 229-page report, according to a def- 
amation lawsuit Sabatini filed in October 
2021 against the Whitehead, Lehmann, and 
a junior colleague who told investigators 
that Sabatini had sexually harassed her. He 


had served on her Ph.D. thesis committee, 
and she became a Whitehead fellow with her 
own lab in a program he was soon appointed 
to direct. She filed a counterclaim against 
Sabatini in December, arguing his law- 
suit was frivolous and retaliatory. (Science 
does not name targets of alleged sexual mis- 
conduct without their permission.) 

Sabatini’s hire may have support from 
a benefactor who could make up some of 
the funding Sabatini lost when HHMI 
fired him. Billionaire hedge fund founder 
Bill Ackman of Pershing Square Capital 
Management supports coveted awards for 
early-career cancer researchers. Sabatini is 
a reviewer for those awards and at- 
tended a 1 March Pershing Square 
Foundation dinner at Manhattan’s Le 
Bernardin restaurant. In his remarks, 
Ackman bemoaned what he called 
Sabatini’s unfair treatment and said 
sidelining Sabatini’s work would be 
detrimental to the field. (Ackman was 
not available for comment.) 

On 21 April, NYU medical school 
dean and CEO Robert Grossman sent 
his monthly email to medical school 
faculty and staff, titling it “Civility 
Rules.” “A mob feels compelled to stri- 
dently ‘cancel’ someone with different 
thoughts, or to baselessly attack an in- 
dividual in ways that can be difficult 
to disprove,” he wrote. “[This is] tol- 
erated in academia, where nameless 
accusers can disparage a colleague’s 
science and life’s work, even when 
their claims are unfounded.” Grossman de- 
clined to comment further. 

Former Sabatini postdoc Anne Carpenter, 
now a computational biologist at the Broad 
Institute, noted on Twitter that in early 
April she was asked to sign the anonymous 
letter of support. “I will not sign,” Carpenter 
wrote in a tweet thread on 10 April. She 
wrote that although the MIT investigation 
did not involve her, she was not surprised 
by its findings. 

In his lawsuit, Sabatini had cited 
Carpenter’s successful career as an ex- 
ample that “contradicts a finding that 
women were disadvantaged” in his lab. But 
Carpenter called that logic “appalling” in 
her thread. “My success is not evidence 
that he did not break policies,’ she wrote. 
“Tt does not prove that he treated men and 
women equally in the lab.” 


science.org SCIENCE 


PHOTO: SYLVAIN GABOURY/PATRICK MCMULLAN VIA GETTY IMAGES 


PHOTO: INT| OCON/AFP/GETTY IMAGES. 


SCIENCE AND POLITICS 


Nicaragua’s universities stagger 
under government pressure 


Attacks on higher education autonomy put research— 


and researchers—at risk 


By Sofia Moutinho 


epression of academia in Nicaragua 
has escalated over the past 3 months. 
President Daniel Ortega’s government 
has closed eight private universities 
and confiscated their assets, effec- 
tively ended university autonomy, and 
erected barriers to foreign research collabo- 
rations. “They are killing the universities,” 
says chemist Ernesto Medina, former dean 
of American University, a private institution 
in Managua that remains open. “All these 
measures serve to silence the critical voices 
in academia and suffocate critical thinking.” 
Ortega has been president for 15 years, hav- 
ing won reelections condemned internation- 
ally as farcical. In April 2018, students took to 
the streets in antigovernment protests. The 
government responded with unprecedented 
repression (Science, 21 December 2018, 
p. 1338). Police and paramilitary groups 
killed more than 300 people and arrested 
more than 1000, according to Amnesty Inter- 
national. After writing two open letters ask- 
ing Ortega to stop the “irrational violence,” 
Medina was forced out of his university posi- 
tion. He fled to Germany in July 2020, becom- 
ing one of more than 200,000 Nicaraguans in 
exile. “It is not safe to be there,” he says. 
Since then, conditions for Nicaragua’s 
small scientific community have worsened. 
After the 2018 protests, international re- 
search conferences were canceled and fund- 
ing for scientific institutions, such as the 
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Nicaraguan Academy of Sciences, was cut. 

Researchers say they must be particularly 
cautious about any work that could be inter- 
preted as political. Research on democracy, 
gender, and human rights has been especially 
hard hit. A social scientist says his group can 
no longer conduct surveys and now pub- 
lishes under pseudonyms if mentioning gov- 
ernment abuses. (Several researchers who 
commented for this article requested ano- 
nymity, fearing retaliation.) Research about 
COVID-19 in Nicaragua also faces obstacles, 
as the government has understated case 
numbers and fired doctors and public health 
experts who express concern about the crisis. 

International collaborations, which had 
helped sustain science in Nicaragua, are 
withering. In February, the government can- 
celed the permits of foreign institutions run- 
ning educational and research programs, 
such as Florida International University and 
Michigan State University. A law passed in 
September 2020 requires any Nicaraguan 
working with international organizations to 
self-identify as a “foreign agent.” But foreign 
agents are considered traitors, researchers 
say. “It practically destroyed any possibility of 
international collaboration,’ a biologist says. 
US. researchers who collaborate with Nica- 
raguans declined to comment for this article, 
fearing reprisal against their colleagues. 

The government claims the eight private 
universities shut down since February had 
failed to disclose financial details. Sources at 
one closed institution, the Polytechnic Uni- 


At the Polytechnic University of Nicaragua in 2018, 
a memorial honors students killed in protests. 


versity of Nicaragua, believe the real reason 
was punishment for students’ involvement in 
the 2018 protests. 

The National Council of Universities 
(CNU), a government body that coordinates 
national policy on higher education, an- 
nounced in February that three new public 
universities would replace the closed ones, of- 
fering enrollment to 20,000 former students. 
But sources say many students have given up 
on academics, fearing persecution. The new 
institutions “reflect the reality of Nicaraguan 
education, submitted to a totalitarian regime 
that has no interest besides keeping the stu- 
dents under control,” Medina says. 

Recent law changes also increased the 
power of CNU, which is now responsible for 
reviewing academic programs, approving ac- 
ademic hires, and selecting deans in all pub- 
lic universities. “It is the end of universities’ 
autonomy,’ a former CNU member says. 

Some of the 40 or so private universities 
that remain open face economic strangula- 
tion. One particular target is the Central 
American University (UCA), which officially 
opposed the government’s violence toward 
protesters. This month, Ortega’s govern- 
ment canceled a state fund for UCA that 
provided scholarships for 4000 low-income 
students. A scientist who is a professor 
there says the university “is a place of resis- 
tance,” and faculty will keep teaching until 
it is forced to close. 

“We are in a stage of consolidation of to- 
talitarianism in education,’ says former UCA 
law professor Maria Asuncién Moreno, who 
is now in exile. “It is no exaggeration to say 
there is no university in Nicaragua anymore.” 
In 2019, Moreno led law students and lawyers 
who collaborated with the Inter-American 
Commission on Human Rights to document 
government abuses. In July 2020, she escaped 
Nicaragua after 12 days hiding from arrest. 

Medina estimates more than 2000 Nicara- 
guan students and young professionals, in- 
cluding many scientists, have fled, mainly to 
Costa Rica and Mexico. He hopes universities 
in other countries will create scholarships for 
exiled students. “We have to train people so 
they can think about the changes we’ll want 
when we have democracy again,” he says. 

Molecular biologist Helena Nader, co-chair 
of the Inter-American Network of Academies 
of Sciences, says Nicaragua is an example of 
the growing threats to science and democ- 
racy in Latin America, including in Venezu- 
ela and El Salvador. “What is happening in 
Nicaragua is very serious, and the world is 
silent,’ she says. & 


Sofia Moutinho is a journalist based in Rio de Janeiro. 
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Anew vaccine’s 
messenger RNA strand 


(multicolored) self- 
amplifies in cells and is 
translated into the viral 

spike protein (purple). 


First self-copying mRNA vaccine 
proves itself in pandemic trial 


Twist on current vaccines reduces dose, eases distribution 


By Jon Cohen 


third messenger RNA (mRNA) vaccine 
for COVID-19 appears to have proved 
its worth. And although it is more 
than a year behind the Moderna and 
Pfizer-BioNTech vaccines now seen as 
gold standards, the new one may come 
with significant advantages: easier stor- 
age, along with lower cost because its “self- 
amplifying” design allows for smaller doses. 
Arcturus Therapeutics of San Diego, which 
staged a placebo-controlled trial of its candi- 
date in more than 17,000 participants in Viet- 
nam, announced last week in a press release 
that the vaccine had 55% efficacy against 
symptomatic COVID-19 and provided 95% ef- 
ficacy against severe illness and death. “It’s a 
huge accomplishment that for the first time a 
self-amplifying RNA vaccine has been shown 
to be safe and effective,” says Deborah Fuller, 
a vaccinologist at the University of Washing- 
ton School of Medicine who is an adviser to 
HDT Bio, which has its own self-amplifying 
COVID-19 mRNA vaccine in human studies. 
Arcturus’s success may also help make 
mRNA vaccines more broadly accessible. Its 
candidate incorporates a freeze-drying pro- 
cess to transform the mRNA-filled solution 
into a powder that’s hydrated before use. As 
aresult it requires far less refrigeration than 
the conventional, liquid mRNA vaccines. 
And Vietnam’s Vinbiocare Biotechnology, 
which collaborated with Arcturus and has 
submitted the trial data to the country’s 
regulators for emergency use authorization, 
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hopes to manufacture the product there. 

The Pfizer-BioNTech and Moderna vac- 
cines contain mRNA that codes for the SARS- 
CoV-2 spike protein. When the vaccines are 
injected, they deliver the mRNA to cells, 
which make copies of spike and then clear 
the foreign genetic material within a few 
days. Arcturus’s self-amplifying vaccine and 
others in development include enzymes from 
alphaviruses to repeatedly copy the genetic 
strand inside a cell, enabling it to persist in 
the body for more than twice as long. 

Some researchers have cautioned that self- 
amplifying vaccines cannot use an mRNA 
modification that seemed key: the replace- 
ment of the natural RNA building block uri- 
dine with pseudouridine. Studies have shown 
the swap leads to higher levels of the spike 
protein and lower production of immune 
chemicals called cytokines that can cause 
side effects. A conventional mRNA vaccine 
made by CureVac failed in an efficacy trial 
last year, and it didn’t use pseudouridine. 
But Arcturus says its efficacy findings show 
the swap isn’t needed. “It’s a big deal for the 
field” says Pad Chivukula, the company’s 
chief scientific officer. 

The trial gave participants two doses, 
each containing 5 micrograms of the self- 
amplifying mRNA, spaced 28 days apart. The 
Pfizer-BioNTech and Moderna vaccines use 
30-microgram and 100-microgram doses, re- 
spectively, for the first two shots. 

The 55% efficacy of Arcturus’s vaccine 
against symptomatic infection is below the 
90% to 95% seen in trials of the first two 


mRNA vaccines. But those two encoded a ver- 
sion of spike that closely matched the SARS- 
CoV-2 variant then circulating. The Arcturus 
candidate encoded the same spike yet had to 
protect against the Delta and Omicron vari- 
ants, which have evolved dramatically from 
the ancestral strain, decreasing the power of 
vaccine-triggered antibodies. 

Fuller says the current real-world effec- 
tiveness of existing mRNA vaccines may be 
in the same ballpark. Of the 43 severe cases 
of COVID-19 recorded by Arcturus during 
the trial, only two were in the vaccinated 
group, and nine of the 10 people who died of 
COVID-19 received the placebo. 

“These are indeed exciting results,’ says 
chemist Benjamin Pierce, who is helping run 
a Ugandan trial of a self-amplifying mRNA 
COVID-19 vaccine made by Imperial College 
London. “The low dose used here—six to 20 
times lower than approved RNA vaccines— 
further indicates that self-amplifying RNA 
technology has such potential. I look forward 
to seeing more of the data from the trial.” 

Fuller says a single dose of the self- 
amplifying vaccine could be used for the two 
primary doses, giving it an even clearer ben- 
efit over its conventional relatives. A booster 
later might still be warranted, as is encour- 
aged for the current mRNA vaccines. But 
self-amplifying mRNAs could lead to more 
durable immune responses, Fuller suggests. 

When the Arcturus trial began, less than 
15% of the eligible Vietnamese population 
had received a single shot of a COVID-19 
vaccine. Now, the figure is 80%, which 
raises the question of how the vaccine will 
perform in the vast majority of people who 
have already been vaccinated or naturally 
exposed to SARS-CoV-2. Arcturus hopes to 
soon launch a 2400-person trial to assess its 
value as a booster shot. This trial will aim to 
show the vaccine boosts antibody responses 
that other studies have shown correlate 
with protection—though no new COVID-19 
vaccine has yet received authorization from 
strict U.S. or European regulators based on 
such “immunobridging” data. 

The Pfizer-BioNTech collaboration and 
Moderna have been criticized for not quickly 
sharing their manufacturing skills and intel- 
lectual property with developing countries, 
which have had relatively little access to 
their mRNA vaccines. Arcturus, in contrast, 
has agreed to a technology transfer deal with 
Vinbiocare, which is building a vaccine man- 
ufacturing plant in Hanoi. 

Some observers question whether the Arc- 
turus vaccine is making its debut too late, at 
least for primary vaccination. Chivukula is 
confident it will find a market in countries 
that have far lower vaccination rates than 
Vietnam and stresses it will be at “a price 
point that everyone can afford.” & 
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Math professor’s trial next test of China Initiative 


Mingging Xiao is fighting U.S. charges that he failed to disclose ties to Chinese institutions 


By Jeffrey Mervis 


his week, a federal jury in Benton, 

Illinois, began hearing the U.S. gov- 

ernment’s evidence against Mingqing 

Xiao, a professor of applied math at 

Southern Illinois University (SIU), 

Carbondale, accused of failing to 
disclose his research ties to China while re- 
ceiving a grant from the National Science 
Foundation (NSF). 

Each day, a small cluster of Xiao’s col- 
leagues are attending the proceedings, which 
opened on 25 April, wearing buttons that 
proclaim “We stand with Mingqing Xiao.” It 
is the fourth jury trial of a U.S. aca- 
demic resulting from a controversial 
law enforcement effort known as the 
China Initiative. The outcome will 
be another important test of the U.S. 
government’s hard-edged strategy of 
prosecuting academic  scientists— 
some two dozen to date—whose ac- 
tions are alleged to have undermined 
national security. 

The China Initiative was launched 
in 2018 to prevent the Chinese govern- 
ment from stealing intellectual prop- 
erty, Department of Justice officials 
said at the time. However, Xiao and 
most of the other prosecuted scientists 
have been accused of financial fraud 
and tax violations, not of improperly 
transferring U.S.-funded research re- 
sults. Most of the cases have been set- 
tled with plea deals or dropped. Three 
jury trials have resulted in two convictions 
and an acquittal. 

Xiao’s background as a Chinese immigrant 
who has built an academic career in the 
United States is typical of many of the defen- 
dants. Born and raised in Guangdong, China, 
Xiao came to the United States after the 1989 
Tiananmen Square uprising, ultimately earn- 
ing his Ph.D. from the University of Illinois, 
Urbana-Champaign. He joined the SIU fac- 
ulty in 2000, became a USS. citizen in 2006, 
and a year later was promoted to full profes- 
sor. In 2016, SIU named him an outstanding 
scholar, and in 2020 the university’s public 
radio station gave him its “neighborly” award 
for founding and running a free Saturday 
morning math academy for children. 

In presenting its case this week against 
Xiao, the government was expected to argue 
that the 60-year-old academic broke the law 
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when he obtained an NSF grant in 2019 to 
develop new mathematical tools for analyz- 
ing high-dimensional data sets. It alleges that 
Xiao had ties to Shenzhen University and 
Guangdong University of Technology that he 
failed to disclose to SIU and NSF, and that 
he received compensation from those entities 
that he didn’t list on his federal tax returns. 
Prosecutors will assert that he did so deliber- 
ately, with the intent to defraud the govern- 
ment. If convicted, he faces up to 20 years in 
prison and a sizable fine. 

Xiao’s lawyer, Ryan Poscablo of Steptoe 
& Johnson, will argue there’s insufficient 
evidence to prove the allegations and, 


Mingqing Xiao went on trial this week in a federal courtroom in Illinois. 


moreover, that there was no criminal in- 
tent behind any incorrect statements Xiao 
might have made. It is not illegal for U.S. 
academics to receive research funding from 
foreign sources. In fact, until recently most 
US. universities encouraged faculty to hook 
up with a burgeoning scientific powerhouse 
such as China, and being of Chinese descent 
was seen as an asset. But any ties must be 
disclosed when applying for a federal grant. 
The nature of such disclosures, and whether 
some of those relationships were deliber- 
ately hidden, have emerged as central issues 
in many China Initiative cases. 

Many academics and civil rights groups 
have vehemently criticized the prosecu- 
tions, calling the initiative a witch hunt 
against U.S. scientists of Chinese origin and 
a misguided attack on legitimate research 
collaborations. Two months ago, Assistant 


Attorney General Matthew Olsen acknowl- 
edged such criticisms and rebranded the 
initiative as a “strategy to counter nation- 
state threats” (Science, 4 March, p. 945). 

But opponents say they see few signs the 
government has changed course. “From 
where I sit, the demise of the China Ini- 
tiative is greatly exaggerated,” says Peter 
Zeidenberg, an attorney for Franklin Tao, 
a University of Kansas, Lawrence, chemical 
engineer convicted earlier this month in a 
decision that some legal experts think could 
be overturned. 

Xiao was indicted in April 2021, and the 
university immediately put him on paid 
administrative leave and launched 
its own investigation. Xiao’s aca- 
demic colleagues initially refrained 
from making any public statements 
on his behalf. “We didn’t have any 
information beyond what the Justice 
Department had said, and we didn’t 
want to jeopardize his defense,’ 
says Edward Benyas, an SIU mu- 
sic professor who helped organize 
the daily vigils. Many also feared 
that, as state employees, they might 
face repercussions. 

However, in November 2021 the 
SIU Carbondale Faculty Association 
called on the government to drop the 
charges and asked the university to 
reinstate Xiao and finance his legal 
defense. The university has declined 
further comment on what a spokes- 
person called “a personnel matter.” 

Asian American groups that have spo- 
ken out on behalf of other academics tar- 
geted by the China Initiative acknowledge 
they have done little on Xiao’s behalf. They 
say meager resources and a lack of band- 
width have prevented them from being 
more active. 

Although Xiao has continued to receive 
his university salary, he has been over- 
whelmed with legal bills and has drained 
his life savings. A GoFundMe campaign 
has raised $26,000 toward a goal of 
$350,000, and supporters say the real need 
tops $800,000. 

Benyas and other Xiao supporters don’t 
expect their presence in the courtroom to 
have an impact on the jury’s deliberations. 
“But we think it’s important to show that 
people are supporting him,” he says. “It’s 
the least we can do.” & 
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By harnessing the strange rules of the subatomic realm, 
quantum sensors could solve one of the universe's biggest mysteries 


ent Irwin has a vision: He aims to 

build a glorified radio that will re- 

veal the nature of dark matter, the 

invisible stuff that makes up 85% 

of all matter. For decades, physi- 

cists have struggled to figure out 

what the stuff is, stalking one hy- 

pothetical particle after another, 

only to come up empty. However, 

if dark matter consists of certain nearly 

massless particles, then in the right set- 

ting it might generate faint, unquenchable 

radio waves. Irwin, a quantum physicist at 

Stanford University, plans to tune in to that 

signal in an experiment called Dark Matter 
Radio (DM Radio). 

No ordinary radio will do. To make the 

experiment practical, Irwin’s team plans to 

448 
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By Adrian Cho 


transform it into a quantum sensor—one 
that exploits the strange rules of quantum 
mechanics. Quantum sensors are a hot topic, 
having received $1.275 billion in funding 
in the 2018 U.S. National Quantum Initia- 
tive. Some scientists are employing them as 
microscopes and gravimeters. But because 
of the devices’ unparalleled sensitivity, Ir- 
win says, “dark matter is a killer app for 
quantum sensing.” 

DM Radio is just one of many new efforts 
to use quantum sensors to hunt the stuff. 
Some approaches detect the granularity of 
the subatomic realm, in which matter and 
energy come in tiny packets called quanta. 
Others exploit the trade-offs implicit in the 


famous Heisenberg uncertainty principle. 
Still others borrow technologies being de- 
veloped for quantum computing. Physicists 
don’t agree on the definition of a quantum 
sensor, and none of the concepts is entirely 
new. “I would argue that quantum sens- 
ing has been happening in one form or an- 
other for a century,’ says Peter Abbamonte, 
a condensed matter physicist and leader of 
the Center on Quantum Sensing and Quan- 
tum Materials at the University of Illinois, 
Urbana-Champaign (UIUC). 

Still, Yonatan Kahn, a theoretical physi- 
cist at UIUC, says quantum sensors open the 
way to testing new ideas for what dark mat- 
ter might be. “You shouldn’t just go blindly 
looking” for dark matter, Kahn says. “But 
even if your model is made of bubblegum 
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» A Collision of galaxy 
clusters separated gas 
(pink) from dark matter 
(blue), mapped from 
subtle gravitational # 
distortions in the images 
of background galaxies. 


and paperclips, if it satisfies all cosmological 
constraints, it’s fair game.” Quantum sensing 
is essential for testing many of those models, 
Irwin says. “It can make it possible to do an 
experiment in 3 years that would otherwise 
take thousands of years.” 


ASTROPHYSICAL EVIDENCE for dark matter 
has accreted for decades. For example, the 
stars in spiral galaxies appear to whirl so fast 
that their own gravity shouldn’t keep them 
from flying into space. The observation im- 
plies that the stars circulate within a vast 
cloud of dark matter that provides the addi- 
tional gravity needed to rein them in. Physi- 
cists assume it consists of swarms of some 
as-yet-unknown fundamental particle. 

In the 1980s, theorists hypothesized 
what soon became the leading contender: 
weakly interacting massive particles 
(WIMPs). Emerging in the hot soup of par- 
ticles after the big bang, WIMPs would in- 
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teract with ordinary matter only through 
gravity and the weak nuclear force, which 
produces a kind of radioactive decay. Like 
the particles that convey the weak force, 
the W and Z bosons, WIMPs would weigh 
roughly 100 times as much as a proton. 
And just enough WIMPs would naturally 
linger—a few thousand per cubic meter 
near Earth—to account for dark matter. 

Occasionally a WIMP should 
crash into an atomic nucleus 
and blast it out of its atom. So, 
to spot WIMPs, experimenters 
need only look for recoiling 
nuclei in detectors built deep 
underground to protect them 
from extraneous radiation. But 
no signs of WIMPs have ap- 
peared, even as detectors have 
grown bigger and more sensi- 
tive. Fifteen years ago, WIMP 
detectors weighed kilograms; 
now, the biggest contain sev- 
eral tons of frigid liquid xenon. 

The second most popular 
candidate—and one DM Radio targets—is 
the axion. Far lighter than WIMPs, axions 
are predicted by a theory that explains a 
certain symmetry of the strong nuclear 
force, which binds quarks into trios to 
make protons and neutrons. Axions would 
also emerge in the early universe, and 
theorists originally estimated they could 
account for dark matter if the axion has 
a mass between one-quadrillionth and 
100-quadrillionths of a proton. 

In a strong magnetic field, an axion 
should sometimes turn into a radio photon 
whose frequency depends on the axion’s 
mass. To amplify the faint signal, physicists 
place in the field an ultracold cylindrical 
metal cavity designed to resonate with ra- 
dio waves just as an organ pipe rings with 
sound. The Axion Dark Matter Experiment 
(ADMX) at the University of Washington, 
Seattle, scans the low end of the mass range, 
and an experiment called the Haloscope at 
Yale Sensitive to Axion CDM (HAYSTAC) at 
Yale University probes the high end. But no 
axions have shown up yet. 

In recent years physicists have begun 
to consider other possibilities. Maybe ax- 
ions are either more or less massive than 
previously estimated. Instead of one type 
of particle, dark matter might even consist 
of a hidden “dark sector” of multiple new 
particles that would interact through grav- 
ity but not the three other forces, electro- 
magnetism and the weak and strong nu- 
clear forces. Rather, they would have their 
own forces, says Kathryn Zurek, a theorist 
at the California Institute of Technology. So, 
just as photons convey the electromagnetic 
force, dark photons might convey a dark 
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“It can make 
it possible to 
do an experiment 
in 3 years that 
would otherwise 
take thousands 
of years.” 


Kent Irwin, 


electromagnetic force. Dark and ordinary 
electromagnetism might intertwine so that 
rarely, a dark photon could morph into an 
ordinary one. 

To spot such quarry, dark matter hunters 
have turned to quantum sensors—a shift 
partly inspired by another hot field: quan- 
tum computing. A quantum computer flips 
quantum bits, or qubits, that can be set to 0, 
1, or, thanks to the odd rules 
of quantum mechanics, 0 and 
1 at the same time. That may 
seem irrelevant to hunting 
dark matter, but such qubits 
must be carefully controlled 
and shielded from external 
interference, exactly what 
dark matter hunters already 

o with their detectors, says 
Aaron Chou, a physicist at 
Fermi National Accelerator 
Laboratory (Fermilab) who 
works on ADMX. “We have 
to keep these devices very, 
very well isolated from the 
environment so that when we see the very, 
very rare event, we're more confident that it 
might be due to the dark matter.” 

The interest in quantum sensors also re- 
flects the tinkerer culture of dark matter 
hunters, says Reina Maruyama, a nuclear 
and particle physicist at Yale and co-leader of 
HAYSTAC. The field has long attracted peo- 
ple interested in developing new detectors 
and in quick, small-scale experiments, she 
says. “This kind of footloose approach has al- 
ways been possible in the dark matter field.” 
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FOR SOME NOVEL SEARCHES, the simplest 
definition of a quantum sensor may do: 
It’s any device capable of detecting a single 
quantum particle, such as a photon or an 
energetic electron. “I call a quantum sen- 
sor something that can detect single quanta 
in whatever form that takes,” Zurek says. 
That’s what is needed for hunting particles 
slightly lighter than WIMPs and plumbing 
the dark sector, she says. 

Such runty particles wouldn’t produce de- 
tectable nuclear recoils. A wispy dark sector 
particle could interact with ordinary matter 
by emitting a dark photon that morphs into 
an ordinary photon. But that low-energy 
photon would barely nudge a nucleus. 

In the right semiconductor, however, 
the same photon could excite an electron 
and enable it to flow through the material. 
Kahn and Abbamonte are working on an 
extremely sensitive photodiode, a device 
that produces an electrical signal when it 
absorbs light. Were such a device shielded 
from light and other forms of radiation and 
cooled to near absolute zero to reduce noise, 


a dark matter signal would stand out as a 
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steady pitter-pat of tiny electrical pulses. 

The trick is to find a semiconductor sen- 
sitive to very low-energy photons, Kahn 
says. The industrial standard, silicon, re- 
leases an electron when it absorbs a pho- 
ton with an energy of at least 1.1 electron 
volts (eV). To detect dark sector particles 
with masses as low as 1/100,000th that of 
a proton, the material would need to un- 
leash an electron when pinged by a photon 
of just 0.03 eV. So Kahn, Abbamonte, and 
colleagues at Los Alamos National Labo- 
ratory are exploring “narrow bandgap” 
semiconductors such as a compound of 


of material composed of two types of ions 
with opposite charges, such as gallium arse- 
nide. The feeble photon spawned by a dark 
photon would nudge the different ions in op- 
posite directions, setting off quantized vibra- 
tions called optical phonons. To detect these 
vibrations, Zurek and Pyle dot the crystal 
with small patches of tungsten and chill it 
to temperatures near absolute zero, where 
tungsten becomes a superconductor that 
carries electricity without resistance. Any 
phonons would slightly warm the tungsten, 
reducing its superconductivity and leading 
to a noticeable spike in its resistance. 


Particles and waves 


Quantum detectors include devices that can detect a single quantum, such as a photon, and devices that exploit 
a quantum trade-off to measure one variable more precisely at the cost of greater uncertainty in another. 


Gallium 
arsenide 


Just one click 

A dark matter candidate 
called a dark photon 
could morph into 

an ordinary photon 

that would trigger a 
quantized vibration ina 


crystal. The vibration, Superconducting 


tungsten 


or phonon, would warm Real photon @ 
superconducting heat : @ ee. ‘ 
sensors on the crystal. Ber iNe Gallium 
Radio signals 
Uncertainty in amplitude and phase 


Quantum trade-off 


Within a resonating 
cavity, a wave of 
hypothetical axions 
could transform into 
faint radio waves, 
uncertain in both 
amplitude and phase. 
Quantum techniques 
could reduce the 
uncertainty in the 
amplitude while 
increasing that in 
the wave's 

irrelevant phase. 
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europium, indium, and antimony. 

Even lighter dark-sector particles would 
create photons with too little energy to 
liberate an electron in the most sensitive 
semiconductor. To hunt for them, Zurek 
and Matt Pyle, a detector physicist at the 
University of California, Berkeley, are devel- 
oping a detector that would sense the infin- 
itesimal quantized vibrations set off when 
a dark photon creates an ordinary photon 
that pings a nucleus. It would “only rattle 
that nucleus and produce a bunch of vibra- 
tions,” Pyle says. “So the detectors must be 
fundamentally different.” 

Their detector consists of a single crystal 
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Magnetic 
field 
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rod 
Increase phase uncertainty to decrease 
uncertainty in amplitude 


Within 5 years, the researchers hope to 
improve their detector’s sensitivity by a 
factor of 10 so that they can sense a single 
phonon and hunt dark-sector particles 
weighing one-millionth as much as a pro- 
ton. To provide the dark matter, such par- 
ticles would have to be so numerous that 
a detector weighing just a few kilograms 
should be able to spot them or rule them 
out. And because so few experiments have 
probed this mass range, even little proto- 
type detectors unshielded from background 
radiation can yield interesting data, Pyle 
says. “We run just in our lab aboveground, 
and we can get world-leading results.” 


SOME PHYSICISTS ARGUE that true quan- 
tum sensors should do something more 
subtle. The Heisenberg uncertainty prin- 
ciple states that if you simultaneously 
measure the position and momentum of 
an electron, the product of the uncertain- 
ties in those measurements must exceed a 
“standard quantum limit.” That means no 
measurement can yield a perfectly precise 
result, no matter how it’s done. However, 
the principle also implies you can swap 
greater uncertainty in one measurement 
for greater precision in the other. To some 
physicists, a quantum sensor is one that 
exploits that trade-off. 

Physicists are using such schemes to en- 
hance axion searches. To make up dark mat- 
ter, those lightweight particles would be so 
numerous that en masse they’d act like a 
wave, just as sunlight acts more like a light 
wave than a hail of photons. So with their 
metal cavities, ADMX and HAYSTAC re- 
searchers are searching for the conversion 
of an invisible axion wave into a detectable 
radio wave. 

Like any wave, the radio wave will have an 
amplitude that reveals how strong it is anda 
phase that marks its exact synchronization 
relative to whatever ultraprecise clock you 
might choose. Conventional radio circuits 
measure both and run into a limit set by the 
uncertainty principle. But axion hunters care 
only about the signal’s amplitude—is a wave 
there or not?—and quantum mechanics lets 
them measure it with greater precision in 
exchange for more uncertainty in the phase. 

HAYSTAC experimenters exploit that 
trade-off to tamp down noise in their ex- 
periment. The vacuum—the backdrop for 
the measurement—can itself be considered 
a wave. Although that vacuum wave has on 
average zero amplitude, its amplitude is still 
uncertain and fluctuates to create noise. In 
HAYSTAC a special amplifier reduces the 
vacuum’s amplitude fluctuations while al- 
lowing those in the irrelevant phase to grow 
bigger, causing any axion signal to stand out 
more readily. Last year, HAYSTAC research- 
ers reported in Nature that they had searched 
for and ruled out axions in a narrow range 
around 19-quadrillionths of a proton mass. By 
squeezing the noise, they increased the speed 
of the search by a factor of 2, Maruyama says, 
and validated the principle. 

Such “squeezing” has been demonstrated 
for decades in laboratory experiments with 
lasers and optics. Now, Irwin says, “These 
techniques for beating the standard quan- 
tum limit [have] been used to actually do 
something better, as opposed to do some- 
thing in a demonstration.” In the DM Radio 
experiment, he hopes to use a related tech- 
nique to probe for even lighter axions as well 
as dark photons. 
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Instead of a resonating cavity, DM Ra- 
dio consists of a radio circuit containing 
a charge-storing capacitor and a current- 
storing inductor—a carefully designed coil 
of wire—both placed in a magnetic field. 
Axions could convert to radio waves within 
the inductor coil to create a resonating sig- 
nal in the circuit at a certain frequency. Re- 
searchers can also look for dark photons by 
reconfiguring the coil and turning off the 
magnetic field. 

To read out the signal, Irwin’s scheme plays 
on another implication of quantum mechan- 
ics, that by measuring a system’s state you 
may change it. The researchers couple 
their resonating circuit to a second, 
higher frequency circuit, so that, much 
as in AM radio, any dark matter signal 
would make the amplitude of the higher 
frequency carrier wave warble. The 
stronger the coupling, the bigger the 
warbling, and the more prominent the 
signal. But stronger coupling also injects 
noise that could stymie efforts to mea- 
sure dark matter with greater precision. 

Again, a quantum trade-off comes 
to the rescue. The researchers modify 
their carrier wave by injecting a tiny 
warble at the frequency they hope to 
probe. Just by random chance, that in- 
put warble and any dark matter signal 
will likely be somewhat out of sync, or 
phase. But the dark matter wave can 
be thought of as the sum of two com- 
ponents: one that’s exactly in syne with 
the added signal and one that’s exactly 
out of syne with it—much as any di- 
rection on a map is a combination of 
north-south and east-west. The experi- 
ment is designed to measure the in- 
syne component with greater precision 
while injecting all the disturbance into 
the out-of-sync component, making the 
measurement more sensitive and ac- 
celerating the rate at which the experi- 
ment can scan different frequencies. 

Irwin and colleagues have already 
run asmall prototype of the experiment. 
They are now building a larger version, 
and ultimately they plan one with a coil that 
has a volume of 1 cubic meter. Implementing 
the quantum sensing is essential, Irwin says, 
as without it, scanning the entire frequency 
range would take thousands of years. 


SOME DARK MATTER HUNTERS are explicitly 
borrowing hardware from quantum com- 
puting. For example, Fermilab’s Chou and 
colleagues have used a superconducting 
qubit—the same kind Google and IBM use 
in their quantum computers—to perform 
a proof-of-principle search for dark pho- 
tons in a very narrow energy range. Like 
a smaller version of ADMX or HAYSTAC, 
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their experiment centers on a resonating 
cavity, this one drilled into the edge of 
an aluminum plate. There a dark photon 
could convert into radio waves, although at 
a higher frequency than in ADMX or HAY- 
STAC. Ordinarily, experimenters would 
bleed the radio waves out through a hole 
in the cavity and measure them with a low- 
noise amplifier. However, the tiny cavity 
would generate a signal so faint it would 
drown in noise from the amplifier itself. 
The qubit sidesteps that problem. Like 
any other qubit, the tiny superconducting 
circuit can act like a clock, cycling between 


Achip that could sense dark photons (top) and an axion detector, 
HAYSTAC, could fit on a tabletop despite their high sensitivity. 


different combinations of 0 and 1 at a rate 
that depends on the difference in energy 
between the circuit’s 0 and 1 states. That 
difference in turn depends on whether 
there are any radio photons in the cavity. 
Even one is enough to speed up the clock, 
Chou says. “We’re going to stick this artifi- 
cial atomic clock in the cavity and see if it 
still keeps good time.” 

The measurement probes only the am- 
plitude of the radio waves and not their 
phase, obtaining greater precision in the 
former in exchange for greater uncertainty 
in the latter, the team reported last year in 
Physical Review Letters. It might speed up 


dark photon searches by as much as a fac- 
tor of 1300, Chou says, and it could be ex- 
tended to search for axions, if researchers 
could apply a magnetic field to the cavity 
while shielding the sensitive qubit. 

One group has invented a scheme to 
search for WIMPs using another candidate 
qubit: a so-called nitrogen vacancy (NV) 
center within a diamond crystal. In an NV, 
a nitrogen atom replaces a carbon atom in 
the crystal lattice and creates an adjacent, 
empty site that collects a pair of electrons 
that can serve as qubit. A WIMP passing 
through a diamond can bump carbon at- 
oms out of the way, leaving a trail of 
NVs roughly 100 nanometers long, says 
Ronald Walsworth, an experimental 
physicist at the University of Mary- 
land, College Park. The NVs will absorb 
and emit light of specific wavelengths, 
so the track can be spotted clearly with 
fluorescence microscopy. 

That scheme has little to do with 
quantum computing, but it would ad- 
dress a looming problem for WIMP 
searches. If current liquid xenon detec- 
tors get much bigger, they should start 
to see well-known particles called neu- 
trinos, which stream from the Sun. To 
tell a WIMP from a neutrino, physicists 
would need to know where a particle 
came from, as WIMPs should come 
from the plane of the Galaxy rather 
than the Sun. A liquid xenon detector 
can’t determine the direction of a par- 
ticle that caused a signal. A detector 
made of diamonds could. 

Walsworth envisions a detector 
formed of millions of millimeter-size 
synthetic diamonds. A diamond would 
flash when pierced by a neutrino or 
WIMP, and an automated system would 
remove it and scan it for an NV track, 
using the time of the flash to determine 
the track’s orientation relative to the 
Sun and the Galaxy, the team explained 
last year in Quantum Science and Tech- 
nology. Walsworth hopes to build a 
prototype detector in a few years. “I ab- 
solutely do not want to claim that our idea 
would work or that it’s better than other ap- 
proaches,” he says. “But I think it’s promising 
enough to go forward.” 

Physicists have proposed many other 
ideas for using quantum sensors to search 
for dark matter, and the influx of money 
should help transform them into new tech- 
nologies, Zurek says. “Things can move 
faster when you're funded,” she says. As 
tool builders, dark matter hunters embrace 
that push. “They have a great hammer, so 
they started looking for nails,” Walsworth 
says. Perhaps they’ll bang out a discovery 
of cosmic proportions. 
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A stark future for ocean life 


Model predicts a mass extinction event in the oceans 


if climate change is uncurbed 


By Malin L. Pinsky and Alexa Fredston 


he year 2021 marked the highest tem- 
perature and likely the lowest oxygen 
content for the oceans since human 
records began (J, 2). These changes 
have put marine species on the front 
lines of climate change. For example, 
marine species’ geographical ranges are 
shifting faster and experiencing more con- 
tractions than those of terrestrial species (3, 
4). However, whether climate change poses 
an existential threat to ocean life has been 
less clear. Marine species are often consid- 
ered to be more resilient to extinction than 
terrestrial ones, and human-caused global 
extinctions of marine species have been 
relatively rare (5). On page 524 of this is- 
sue, Penn and Deutsch (6) present extensive 
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modeling to reveal that runaway climate 
change would put ocean life on track for a 
mass extinction rivaling the worst in Earth’s 
history. Furthermore, they reveal how keep- 
ing global warming below an increase of 
2°C compared with preindustrial levels 
could largely prevent these outcomes. 

The topic of climate change and species 
extinction on land has been fraught with 
controversy. This is in part because of de- 
bates over suitable methods of predicting 
extinctions and in part because of the rela- 
tively few documented extinctions to date 
(7). The marine research community has 
largely avoided making projections of ex- 
tinction risk (8), even though experts widely 
see climate change as a major threat to the 
global oceans (9). This has left the watery 
70% of Earth’s surface as a giant blank spot 
in the future projection of life on Earth. 

Penn and Deutsch modeled suitable habi- 
tats for marine species on the basis of well- 
described physiological processes that link 


metabolic demand for oxygen to the supply 
of oxygen to organismal tissues as a func- 
tion of temperature. As warming causes the 
demand for oxygen to exceed supply in a 
given location, survival likely becomes un- 
tenable, causing extinctions. The authors 
calibrated their model against the oceano- 
graphic changes they reconstructed for the 
end-Permian mass extinction event, which 
was a period of extensive warming and de- 
oxygenation 250 million years ago, colloqui- 
ally known as the Great Dying. Although 
not a perfect analog to the current climate 
situation, the end-Permian mass extinction 
is one of the most cataclysmic periods in 
Earth’s history for which there are records 
of extensive warming and extinction. Penn 
and Deutsch built confidence in their physi- 
ological model the same way climate mod- 
elers do—by integrating well-understood 
mechanisms and by demonstrating its abil- 
ity to explain historical patterns. 

The model predicts several key patterns in 
future extinction risk. For instance, the tropi- 
cal oceans are expected to lose the most spe- 
cies as temperature and oxygen conditions 
exceed the ranges tolerable for most extant 
species today. However, many tropical spe- 
cies are expected to migrate and survive at 
higher latitudes as the global oceans warm 
up. This process mirrors the widespread 
range shifts that have already been observed 
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Accelerating ocean warming and deoxygenation 
threaten a mass extinction rivalling the worst 

in Earth’s history, especially for cold water species, 
such as the Atlantic rock crab shown here. 


on land and in the oceans (3). However, po- 
lar species are likely to go globally extinct as 
their suitable oceanographic conditions dis- 
appear entirely. Climate change is, in effect, 
walking species off the ends of the Earth. 
Widespread warming and deoxygenation in 
the end-Permian mass extinction had a simi- 
lar geographic signature (0). 

The alternative scenarios explored by 
Penn and Deutsch illustrate the key choice 
that society is facing. Unabated greenhouse 
gas emissions would push radiative heating 
on Earth to 8.5 W/m? by 2100 and beyond 
in the centuries after. This future produces 
a doomsday scenario of end-Permian-like 
mass extinction by the year 2300. An alter- 
native, consistent with the Paris Agreement 
(the 2016 international treaty on climate), 
is to limit or even reverse greenhouse gas 
emissions and keep radiative forcing to 2.6 
W/m? by 2100. Doing so would keep climate 
change as a relatively minor threat to ma- 
rine species’ existence—lower than other 
more-direct threats, such as habitat loss or 
fishing—although the widespread extirpa- 
tion of species from their historical loca- 
tions will remain inevitable. Fortunately, 
greenhouse gas emissions are not on track 
for the worst-case scenario given policies 
to limit greenhouse gas emissions and the 
slower-than-projected growth of global 
economies (JI). How close to the best-case 
scenario human society can hew, however, 
remains one of the most pressing questions 
for the future of life in the oceans. 

The model of Penn and Deutsch also il- 
luminates key scientific uncertainties that 
surround the fate of marine life, including 
the degree to which species can expand 
into new territory as it becomes climatically 
suitable. Observations of rapidly expand- 
ing range edges make it clear that a pessi- 
mistic, no-dispersal scenario is unrealistic, 
but neither is it likely that all species will 
perfectly track suitable habitats in an opti- 
mistic, perfect-dispersal scenario (12). The 
transient dynamics of species expansion and 
contraction (13) will likely determine which 
species survive in the coming decades. 
Another key uncertainty is the percentage of 
habitat that species can lose before going ex- 
tinct. Calibrating against end-Permian data, 
Penn and Deutsch predict that loss to be 50 
to 70%. Although broadly consistent with 
the criteria published by the International 
Union for Conservation of Nature (74), very 
few data exist for marine species on which to 
calibrate these values because most human- 
caused marine extinctions have been driven 


SCIENCE science.org 


by overexploitation, not climate change (J5). 

The research of Penn and Deutsch also 
highlights key areas in need of investigation, 
such as metabolic traits for more marine 
species. Their modeling projects the future 
for all marine life using information from 
only a few dozen species, and future research 
will be needed to understand trait variation 
and correlation across life history strategies, 
habitats, and phylogenies. Other important 
extensions will include a better understand- 
ing of the role that microclimates and habi- 
tat refugia play in species survival, the po- 
tential for evolution of metabolic limits, the 
role of changing species interactions in pro- 
moting or impeding species survival, and the 
oceanographic dynamics in shelf and other 
coastal habitats that are currently not well 
represented by global climate simulations. 
Improved projections of these processes will 
refine the forecasts for marine biodiversity 
under climate change. 

Not too long ago, canaries warned coal 
miners of toxic gas accumulation. Today, 
marine life is warning the world of a differ- 
ent and global gas accumulation. Staving off 
widespread biodiversity loss and the sixth 
mass extinction is a global priority. Because 
marine extinctions have not progressed as 
far as those on land, society has time to turn 
the tide in favor of ocean life. Exactly where 
the future falls between the best-case and 
worst-case scenarios will be determined 
by the choices that society makes not only 
about climate change, but also about habi- 
tat destruction, overfishing, and coastal 
pollution. With a coordinated approach 
that tackles multiple threats, ocean life as 
we know it has the best chance of surviving 
this century and far beyond. 
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INFECTIOUS DISEASES 


From outbreaks 
to endemic 
disease 


Relief from the effects 
of epidemics may signal 
the start of low-level 
disease persistence 


By Michael F. Antolin 


n epidemiology, endemism describes a 
state in which pathogens persist at low 
levels even if they cause high mortality 
and morbidity during outbreaks. Ende- 
mism occurs when disease falls to a low 
and possibly tolerable (treatable) level. 
But persistence at low prevalence presents 
a health challenge, especially for virulent 
pathogens that may spill over to humans 
and companion animals, spill back to other 
wildlife species, or reemerge again in epi- 
demics. On page 512 of this issue, Mancy et 
al. (1) present a longitudinal study (from 
2002 to 2016) of canine rabies in the Seren- 
geti region of northern Tanzania. This study 
addressed how the rabies virus continues 
to circulate in domestic dogs at less than 
0.15% prevalence while showing (weak) pat- 
terns of density dependence (more rabies at 
higher population densities of dogs). This 
study highlights the continued health risks 
posed by pathogens that become endemic. 
The dataset reported by Mancy et al. in- 
cludes observations of tens of thousands of 
domestic dogs; contact tracing of infections 
from rabid dogs to other dogs and humans; 
and detailed tracking of dog densities, dis- 
ease incidence, and vaccination campaigns. 
By implementing a complex individual- 
based model using the spatially detailed 
data, the authors determine that endemic 
persistence of Rabies lyssavirus depends on 
identifying the spatial scale (1 km?) at which 
relatively small outbreaks begin by arrival of 
rabid dogs from elsewhere. A critical feature 
is that transmission occurs through super- 
spreading, in which a few rabid dogs travel 
relatively long distances and are responsible 
for most of the infectious bites. Rabies out- 
breaks in Tanzania resolved into 22 domi- 
nant transmission chains that arose in differ- 
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ent localities at different times. 
A previous study from the same 
area sequenced R. lyssavirus 
isolates from a subset of cases 
and placed them in a phylogeo- 
graphic framework that veri- 
fied the existence of genetically 
identifiable lineages (2). Similar 
rabies transmission chains have 
been identified in the Central Af- 
rican Republic (3). 

Canine rabies provides a clear 
example of a virulent pathogen, 
which is inevitably fatal because 
dogs do not naturally develop 
immunity, endemically persist- 
ing because transmission and 
infection are spatially segregated 
into a metapopulation (4, 5). In 
a rabies metapopulation, local 
flare-ups in places with denser 
dog populations burn themselves 
out, but rabid dogs disperse 
while transmitting the virus and 
starting new local outbreaks. 
With rabies prevalence being 
low across the region of north- 
ern Tanzania, rabies infections 
cannot at that scale be predicted 
from dog densities. Disaggregat- 
ing the region into individual 
lineages at the scale of 1 km? reveals locally 
subdivided and temporally separated infec- 
tion dynamics. Furthermore, dynamics are 
density-dependent at the local scale, with ra- 
bies persisting in the region asynchronously 
and endemically as a metapopulation with 
multiple independent chains of transmission. 

Localized contagion and outbreaks of dis- 
ease are not new to the human experience. 
This was observed in Medieval Europe dur- 
ing the Black Death (plague) era that be- 
gan in 1347 AD and caused local outbreaks 
for three centuries. The advice “Fuge cito, 
vade longe, redetarde” (flee quickly, go far, 
return slowly) was commonly given when 
outbreaks flared up. Plague (Yersinia pestis) 
is a zoonotic, flea-borne bacterial pathogen 
of ground-dwelling rodents that catastrophi- 
cally spilled over to humans in at least three 
large-scale epidemics dating back to 550 AD. 
Plague does not persist as an epidemic hu- 
man disease and now recurs only in places 
where competent rodent hosts and their 
fleas are uncontrolled in human proximity 
(6). Persistence in rodent host populations, 
however, depends on metapopulation dy- 
namics (7-9). For example, plague outbreaks 
among prairie dog colonies on the Great 
Plains of the United States sporadically cause 
local extinctions of prairie dog towns, which 
are then recolonized ~4 to 5 years after the 
epidemic wanes (8). Plague retains high viru- 
lence and, in the time since its introduction 
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Canine rabies was tracked in 

tens of thousands of domestic 

dogs in northern Tanzania to 

understand the persistence of 
endemic disease. 


into North America in the early 1900s, has 
drastically reduced overall prairie dog abun- 
dance. Despite high local mortality, plague 
has not caused overall extinction of prairie 
dogs from the grasslands. Like rabies, plague 
persists endemically and is usually undetect- 
able between outbreaks in prairie dog towns. 

Does rabies endemism in dogs in Tanzania 
represent a general phenomenon of patho- 
gen persistence? Rabies in Tanzania is in 
many ways a distinct example, with trans- 
mission directly from dog to dog, no wild an- 
imal reservoir, a long serial interval between 
infections (up to several hundred days), and 
infections that are nonimmunizing and in- 
evitably fatal. Other pathogens have a diver- 
sity of transmission modes (directly among 
hosts, vector-borne, or from animal or envi- 
ronmental reservoirs), differences in serial 
infection intervals (time from transmission 
to a host until the onset of symptoms), and 
heterogeneity of transmission patterns (su- 
perspreaders) and can elicit variable im- 
mune responses. All of these factors can af- 
fect persistence and endemism. Further, in 
host-pathogen systems in the wild, observing 
infection dynamics through contact tracing, 
accurately estimating host abundance, eval- 
uating host immunity, and detecting move- 
ments of infected hosts without bias is much 
more difficult (10). 

What is generalizable is that pathogen 
transmission is scale-dependent and that 


most host populations are sub- 
divided and are not well mixed 
at large scales. Ecological in- 
teractions, including pathogen 
transmission patterns, gener- 
ally occur locally (10-12), and 
epidemiological studies that 
aggregate data to large spatial 
scales can result in spurious 
conclusions (13). Mancy e¢ al. 
tested alternative spatial scales 
in their model, with the local 
scale of 1 km? providing the best 
fit to the rabies prevalence data. 

After 2 years of the CO- 
VID-19 pandemic, a_ transi- 
tion to endemism and the 
end of widespread infection 
chains and less obtrusive pub- 
lic health measures may be in 
reach. Widespread vaccination 
should lessen disease and slow 
emergence of new viral vari- 
ants. The likelihood, however, 
is that severe acute respiratory 
syndrome coronavirus 2 (SARS- 
CoV-2) will not entirely burn 
out but will slide into endemic 
persistence in localized chains 
of transmission. But endemic 
persistence still carries health 
risks in local populations. In Tanzania dur- 
ing the 2002 to 2016 period, more than 1000 
humans were bitten by rabid dogs, with 44 
rabies deaths (7). Vaccination of dogs has 
been implemented but not at high enough 
levels to stop transmission and eradicate 
R. lyssavirus. Even in the modern vaccina- 
tion era, several early-childhood diseases 
(measles, mumps, rubella, and whooping 
cough) and the dengue virus persist in lo- 
cal transmission chains that escape detec- 
tion without regular surveillance (14, 15). 
Although endemism is more desirable than 
an epidemic, it is not the end of disease but 
a new and alternative challenge. & 
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BATTERIES 


A granular approach to electrode design 


The consistency of cathode particles plays a pivotal role in battery performance 


By Jie Xiao 


ailoring the intrinsic properties of 

battery materials and designing 

their multiscale structures are very 

important to maximize their elec- 

trochemical performance. When 

electrochemically active materials— 
often in powder form—are used in bat- 
teries, they are usually dispersed within 
porous composite electrodes immersed in 
a liquid electrolyte and provide ionic and 
electronic conduction pathways for the 
electrochemical reactions. Therefore, the 
full utilization of electrochemically active 
particles in the electrodes (i.e., cathodes 
and anodes) depends not only on their 
own properties but also on those of the 
conductive network for the particles, as 
well as the interactions between the par- 
ticles and the network. On page 517 of this 
issue, Li et al. (J) report on the dynamics 
of particle networks in composite cathode 
for lithium-ion batteries and highlight the 
importance of size consistency for the elec- 
trode particles. 

The functionality of an electrode de- 
pends on the integration of the respective 
contributions from every active particle in 
the electrode network. A typical battery 
electrode consists of electrochemically ac- 
tive particles embedded in a porous com- 
posite whose porosity can be tailored for 
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specific applications (2). Ideally, the elec- 
trolyte infiltrates into every pore of the 
electrode to ensure good ionic conductiv- 
ity. In the perfect scenario, ions are ac- 
cessible everywhere, and the conductivity 
within the electrodes is good because of 
the interconnected carbon in the electrode 
structure (3). However, in reality, the lo- 
cal conductive network surrounding the 
active cathode particles, which facilitates 
the interactions of ions and electrons for 
the redox reactions, is not homogeneous, 
and thus the reaction rate of the active par- 
ticles is also not homogeneous. 

The electrode heterogeneity creates a 
dynamic evolution of the electrochemical 
activities of individual particles inside the 
electrode, which affects battery perfor- 
mance. The heterogeneous and dynamic 
electrochemical activities of electrode par- 
ticles cannot be measured by the electro- 
chemical output, such as the voltage or the 
battery capacity, because these bulk mea- 
surements do not give information about 
the movement and interactions among the 
individual particles. During charge-dis- 
charge cycles, when lithium ions go in and 
out of the host particles in the cathode, a 
dynamic equilibrium of the particle system 
is reached through the coevolution of all 
particles driven by spontaneously yielded 
internal pressure. Tracking the interac- 
tions of individual active particles within 
the electrode is needed to understand the 
dynamic equilibrium within the electrode. 

Li et al. investigated the dynamic evo- 


lution of the particles in a battery cath- 
ode (see the figure). Using synchrotron 
nanoresolution holo-tomography, they 
tracked the morphological and chemical 
characteristics of thousands of individual 
active particles. Inside the cathode, the 
morphological defects and the electro- 
chemical activities of individual particles 
coevolved and transitioned through three 
distinct stages: single-particle activation, 
where electrode particles begin partici- 
pating in the electrochemical reactions; 
followed by segregation of groups of par- 
ticles, where the utilization of particles 
and degree of damage to the particles 
vary; and finally, global homogenization, 
where the modulation between the electro- 
chemical activity and mechanical damage 
reduces those variations. The authors also 
observed a correlation between the collec- 
tive electrochemical activity of, and the 
damage to, the particles that occur during 
charge-discharge cycles. According to their 
model, particles with higher electrochemi- 
cal activity are more likely to experience 
mechanical damage, such as particle frac- 
ture or detachment from the electrodes. 
This mechanical damage in return would 
suppress the electrochemical reaction by 
increasing the resistance. 

The findings of Li et al. highlight the 
need for high-throughput analysis and the 
importance of morphological and chemi- 
cal consistencies of electrochemically active 
particles for electrode robustness against 
charge-discharge cycles. The relevance of 


The evolution of electrochemically active particles in composite cathodes 

Each active cathode particle evolves and shifts inside composite cathodes differently during charge-discharge cycles. This is caused by the heterogeneity of the 
conductive structure (shown here as carbon) and various inconsistencies among the particles, such as in size and degree of cracking. Over time, the particles settle into 
clusters and stabilize the electrochemical property of the battery. 


Particles with more points of contact with 
the conductive carbon have more cracking 
and a higher capacity contribution. 
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Particles with fewer points of contact with 
the conductive carbon have less cracking 
and a lower capacity contribution. 


Conductive carbon network 


Particles with a higher capacity 
contribution clump together, affecting 
the global homogenization transition. 
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particle homogeneity for battery materials 
has been well realized in commercial bat- 
teries. In some cases, a mixture of particles 
with different sizes is purposely packed into 
electrodes to increase the electrode den- 
sity, provided that the particles are smaller 
than the critical size beyond which cracking 
would occur (4, 5). In academic research, 
however, there are often considerable varia- 
tions in the size, shape, and surface prop- 
erties of the same materials being used by 
industry, which can lead to inconsistent 
findings. Li et al. highlight the value of se- 
lecting materials with particle-level consis- 
tency for application-oriented research. 

The contact between active particles and 
carbon, as well as the material consistency 
of the porous electrode itself, affect the ca- 
pacity and cycle life of batteries. Research 
efforts using field-guided self-assembly (6), 
freeze-drying (7), or printing technologies 
(8) to gauge the electrode-level structure 
ordering are possible solutions if the po- 
rosity of electrodes can be better controlled 
and monitored. The electrochemical per- 
formance of a battery is determined by the 
electrode that’s the weakest link. When as- 
sessing cathode (or anode) materials, the 
corresponding anode (or cathode) must 
also be sufficiently stable that the observed 
battery performance is not muddled by un- 
wanted discrepancies. 

In the future, it would be of great in- 
terest to develop an integrated approach 
for studying all the different components 
of a battery. For example, a carbon addi- 
tive in positive electrodes is usually only 
about 2% by weight but establishes the en- 
tire conductive carbon framework for elec- 
trodes (3). Similarly, the distribution of the 
very limited amount of electrolyte in a bat- 
tery plays an equally outsized role in bat- 
tery performance, specifically, in how ions 
move within the electrodes. A fundamental 
understanding and large-set data analysis 
of the dynamics of the particulate system 
in battery electrodes can help guide more 
refined battery materials and electrode 
manufacturing in the future, which will 
only become more relevant as the world 
gradually moves away from fossil fuel de- 
pendency. 
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Honey bees initiating dance language experience an increased concentration of dopamine, indicating the 
presence of a wanting system in insects that is analogous to that observed in mammals, including primates. 


Why do animals want 


what they like? 


As in mammals, honey bee motivation for 
wanting rewards is modulated by dopamine 


By Jair E. Garcia! and Adrian G. Dyer!” 


he ability to experience pleasure or 

displeasure is created by the brain and 

therefore has a physiological compo- 

nent (7). In 1872, based on his observa- 

tions of humans and animals, Charles 

Darwin proposed that evolution may 
select affective reactions that shape behav- 
ior, and thus individual fitness (2). Affective 
neuroscience in mammals searches for 
mechanistic explanations that underpin the 
experience of liking or disgust (7). Recently, 
the role of dopamine was reevaluated, show- 
ing that it is an important neuromodulator 
for wanting rather than liking rewards (3). 
On page 508 of this issue, Huang et al. (4) 
show that regulation of the neurotransmitter 
dopamine is also an important component 
of the motivation of wanting in honey bees 
(Apis mellifera). This suggests that the fit- 
ness benefits of a motivation wanting system 
regulated by dopamine are likely to be con- 
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served and may explain behavior in a wide 
range of animals. 

In humans and other mammals, analo- 
gous brain circuitry is activated by the 
wanting or desire for a reward (1). For ex- 
ample, different reward types such as food, 
sex, addictive drugs, and even various forms 
of experienced artwork show evidence of 
stimulating a brain network that includes 
the prefrontal cortex, ventral pallidum, and 
amygdala of the primate brain in a similar 
manner. Additionally, common mammalian 
brain structures, such as the amygdala in 
rodents (3), also show activation to wanting 
a reward (5). This suggests a potential com- 
mon reward network of interacting brain re- 
gions that underpin rewarding experiences. 
An important component of such a reward 
system is the regulation of dopamine, which 
helps modulate incentive motivation, the 
wanting for certain rewards. 

Honey bees are an important comparative 
species for how brains learn, and together 
with other insect models, such as the com- 
mon fruit fly (Drosophila melanogaster), are 
a test bed to investigate what might be the 
minimum brain structures required for infor- 
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mation processing (6). Honey bees are adept 
learners of a variety of visual and olfactory 
stimuli when associated with a sweet-tasting 
sugary reward. Individual bees are then moti- 
vated to communicate the value of their per- 
ceived experience to hive mates through the 
bee waggle dance language (7). This behav- 
ioral paradigm enabled decoding of the bee 
dance language that contains information 
about direction and distance of profitable 
food sources and enables good experimental 
access for testing bee sensory perception (8). 

Huang e¢ al. investigated how honey bees 
forage, and what might motivate individuals 
to leave the hive in search of sweet-tasting re- 
wards in the environment. By monitoring the 
foraging and subsequent dance language by 
bees from different test groups that were cap- 
tured at different phases of their activities, it 
was possible to understand the potential role 
of the dopaminergic system in the anticipa- 
tion of wanting a reward. When bees that 
had recently visited a profitable food source 
started dancing to communicate to other 
bees in the hive, the dancing bees showed 
a significant increase in dopamine brain 
concentrations, which was quantified using 
high-performance liquid chromatography on 
single brains. However, this increase is tem- 
porary and drops by the end of the bee’s wag- 
gle dance. When dancing bees subsequently 
depart the hive to seek more rewards, dopa- 
mine again increases, followed by a decrease 
while feeding. These fast transient changes 
of dopamine concentrations in the bee brain 
show that it is the expectation of wanting a 
reward that drives this important compo- 
nent of how motivation is modulated in the 
reward system—a mechanism similar to that 
reported in some mammals (J, 5). 

The role of dopamine as a biologically 
relevant component of foraging motivation 
was confirmed in odor-learning experiments 
in which bees were topically treated with 
increasing dopamine amounts and _ subse- 
quently showed significantly better learning 
profiles compared to controls. Bees treated 
with exogenous dopamine also demonstrated 
improved memory for recalling learned tasks 
compared to controls. Thus, modulation of 
dopamine concentrations in bee brains is es- 
sential in a variety of foraging tasks, includ- 
ing expectation, motivating individuals, and 
improving their capacity to learn important 
flower signaling such as odor to identify prof- 
itable foraging options (9). An improved un- 
derstanding of the mechanisms that mediate 
expectation, wanting, and motivation in bees 
(and likely many other animals) according 
to individual experience and resulting dopa- 
mine concentrations in the brain could also 
explain why behavioral differences between 
individuals are frequently reported, even for 
individuals from the same hive (J0). 
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Results from the study of Huang et ail. in- 
dicate that honey bees share with mammals 
common neural mechanisms for encoding 
the wanting of stimuli with positive hedonic 
value. Thus, as anticipated by Darwin 150 
years ago, evolution likely shaped wanting 
mechanisms that improve animal fitness, 
by experiencing pleasure from nutritious 
food, or triggering responses from emo- 
tional stimuli linked to survival brain cir- 
cuits (77). Conversely, an imbalance of this 
mechanism may have negative effects on fit- 
ness by, for example, promoting pathologies 
such as addiction (1). 

Honey bees and a variety of native bees 
are major pollinators of important crops, 
including almonds, citrus fruits, and many 
vegetables, with both human nutritional and 
economic value in the tens of billions of dol- 
lars a year within the US alone (72). A deeper 
understanding of how dopamine modulates 
wanting in these important animals can 
lead to improved ways of motivating bees 
to forage. For example, the queen bee in a 
hive is able to deploy chemical pheromones 
that will capture the attention of young bees 
to entice them to perform certain tasks by 
stimulating dopamine pathways in the brain 
(73), and equivalent stimulations to drive the 
wanting systems of forager bees would likely 
benefit pollination services by motivating 
them to visit more flowers. 

As the honey bee continues to be a key 
model for what can be achieved with a 
miniaturized brain (6), understanding the 
neural mechanisms of wanting and the 
evidence of variability among individuals 
(10), depending upon acquired experience, 
is informative for explaining different ani- 
mal behaviors. 


REFERENCES AND NOTES 


1. K.C. Berridge, M.L. Kringelbach, Curr. Opin. Neurobiol. 
23, 294 (2013). 

2. C.Darwin, The Expressions of Emotions in Man and 
Animals (Oxford Univ. Press, 1872). 

3. K.C. Berridge, M. L. Kringelbach, Neuron 86, 646 (2015). 

4. J.Huang etal., Science 376, 508 (2022). 

5. K.C. Berridge, T.E. Robinson, Am. Psychol.71,670 
(2016). 

6. R.Menzel, Nat. Rev. Neurosci.13, 758 (2012). 

7. K.von Frisch, The Dance Language and Orientation of 
Bees (Harvard Univ. Press, 1967). 

8. M.V. Srinivasan, S.Zhang, M. Altwein, J. Tautz, Science 
287,851(2000). 

9. A.Kantsaet al., Nat. Ecol. Evol.1,1502 (2017). 

10. L.Chittka, P. Skorupski, N.E. Raine, Trends Ecol. Evol. 24, 
400 (2009). 

11. J.LeDoux, Neuron 73, 653 (2012). 

12. S.G.Potts et al., Summary for policymakers of the 
assessment report of the Intergovernmental Science- 
Policy Platform on Biodiversity and Ecosystem Services 
on pollinators, pollination and food production (IPBES, 
2016). 

13. K.T. Beggs et al., Proc. Natl. Acad. Sci. U.S.A.104, 2460 
(2007). 


ACKNOWLEDGMENTS 
A.G.D. receives funding from the Australian Research Council. 


10.1126/science.abp8609 


MATERIALS SCIENCE 


An adsorbent 
with flexible 
nanoscopic 
pores 


Scanning transmission 
electron microscopy 
shows the adaptive pores 
of a zeolite 


By Tom Willhammar and Xiaodong Zou 


anoporous materials have large sur- 

face areas and well-defined pores at 

the molecular scale, making them 

attractive as selective adsorbents 

and catalysts (J). They can act as 

molecular sieves and have garnered 
interest because of their potential use as 
energy-efficient adsorbents for gas separa- 
tion and storage applications, such as car- 
bon capture. The adsorption properties of 
some of these materials suggest that inter- 
actions between the nanoporous material 
and the molecules or ions it adsorbs might 
induce a degree of structural flexibility in 
the pores. On page 491 of this issue, Xiong 
et al. (2) describe changes in the pore ge- 
ometry of the industrially important zeo- 
lite ZSM-5 during the gas adsorption-de- 
sorption process. This direct observation 
of structural flexibility in a nanoporous 
material reveals ways to manipulate the 
dynamic behavior and function of these 
materials. 

Zeolites have a nanoporous three-dimen- 
sional structure built from corner-sharing 
silica (SiO,) and alumina (AIO,) tetrahe- 
dra. To date, more than 250 different zeo- 
lite structures have been discovered (3), 
each with a distinctive pore size and shape. 
However, adsorption experiments in these 
materials have shown that gas molecules 
larger than the expected pore size can en- 
ter the material, which indicates structural 
flexibility of the material associated with 
the so-called “host-guest” interactions. To 
understand the size and shape selectivity 
properties of a material as an adsorbent 
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or catalyst, it is important to know how its 
structure changes during the adsorption- 
desorption of the gas molecules. 

Structural flexibility caused by physical 
or chemical stimuli has been reported in 
some zeolites (4). This structural flexibility 
can be induced by ion exchange, dehydra- 
tion, or gas adsorption, and the flexibility 
can manifest as a change in the unit-cell 
volume and/or in the symmetry and pore 
size. Diffraction-based techniques, includ- 
ing x-ray, neutron, and electron diffrac- 
tion, have served as important tools for 
studying these structural flexibilities. For 
example, in situ powder x-ray diffraction 
has shown that the unit-cell volume can 
expand by up to 25% upon ion exchange 
(5). The accessible pore window can be 
substantially altered upon dehydration 


method is integrated differential phase 
contrast scanning transmission electron 
microscopy (iDPC-STEM). Researchers 
have used iDPC-STEM to observe physically 
adsorbed organic molecules and chemically 
bound single molybdenum oxide clusters in 
the nanopores of zeolite ZSM-5 (9). When 
synthesizing zeolites, organic molecules 
are often used as a “structure-directing 
agent” to help build the three-dimensional 
structure of a zeolite. Using iDPC-STEM, 
researchers have been able to track the 
location of the structure-directing organic 
molecules in the zeolite pores, which pro- 
vides insights into the formation of the zeo- 
lite material (10). iDPC-STEM imaging has 
also been used to observe para-xylene mol- 
ecules adsorbed inside the straight chan- 
nels of the zeolite MFI (77). This revealed 


The flexible pores of a zeolite 
The nanopores of a zeolite adapt their shape to fit the adsorbed molecules. Using scanning transmission 

electron microscopy, Xiong et al. directly observed this structural change at the sub-unit cell scale during the 
entire adsorption-desorption process. 


or ion exchange, which greatly affect the 
adsorption and desorption properties (4, 
6). Three-dimensional electron diffraction 
(3DED) has also been a powerful tool for 
determining the structure of polycrystal- 
line porous materials (7). Researchers have 
used 3DED to locate carbon dioxide (CO,) 
molecules in the zeolite CHA (8). However, 
these diffraction-based techniques can 
only provide structure information aver- 
aged over the entire measured material. 
To obtain a more detailed understanding 
of the mechanisms underlying structural 
flexibility, it is necessary to visualize the 
host-guest interactions at the individual- 
pore level. 

Recent advances in imaging techniques 
with atomic resolution have made it pos- 
sible to visualize the guest molecules inside 
the voids of porous materials. One such 
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that the orientation of the para-xylene mol- 
ecules in the channel is correlated to the 
channel geometry and therefore could be 
used to detect host-guest van der Waals in- 
teractions in porous materials (77). 

Using an in situ transmission electron 
microscopy gas-cell specimen holder, 
Xiong et al. studied the gas adsorption- 
desorption process in zeolite ZSM-5 us- 
ing benzene as a probe molecule (see 
the figure). Their imaging results show a 
reversible local deformation of pore win- 
dows from circular to elliptical upon the 
pressure-induced adsorption-desorption 
of the benzene molecules. The pores are 
stretched by up to 15% in aspect ratio to 
optimize the interaction with the benzene 
molecules and facilitate their diffusion 
into the channels. The pore structural flex- 
ibility is associated with the flexibility of 


the Si-O-Si bond angles, and the intro- 
duction of benzene molecules in the pores 
reduces such flexibility, as predicted by ab 
initio molecular dynamic simulation. The 
direct observation of such a delicate in- 
terplay between the zeolite host and guest 
molecules allows a better understanding of 
the mechanisms behind the complex ad- 
sorption-desorption behavior, which will 
help in the design and selection of adsor- 
bents for specific applications. 

Unlike zeolites, metal-organic frame- 
works (MOFs) have a higher degree of 
structural flexibility. Structural changes 
in MOFs can be triggered using different 
guest molecules for different adsorption- 
desorption behaviors, such as reversible 
pore opening-shrinking, or “breathing,” 
and selective gate-opening (72), which has 
been studied using x-ray techniques. In 
earlier efforts to study the CO, adsorption 
of MOFs, researchers have used low-dose 
cryogenic electron microscopy to observe 
CO, molecules in the channels of the MOF 
ZIF-8 (13). Direct observation of sorption- 
induced structural changes of the MOFs at 
the individual-pore level could be achieved 
by a combination of an in situ gas envi- 
ronment and low-electron dose imaging 
to prevent electron beam damage to the 
MOFs. This knowledge is important for the 
design of MOFs where structural flexibility 
can be used to tailor adsorption properties 
through chemical or physical interactions 
between the host and guest. 

The possibility of visualizing local 
structural changes induced by adsorbed 
molecules during an entire adsorption-de- 
sorption process should serve as an inspi- 
ration for applications of in situ electron 
microscopy on other flexible nanoporous 
materials beyond zeolites and MOFs. 
Direct observation of guest molecules in 
the channels using iDPC-STEM imaging 
will provide an understanding of host- 
guest interactions in all those nanoporous 
materials, which is important for develop- 
ing efficient adsorbents and catalysts with 
enhanced selectivity and performance. & 
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INFECTIOUS DISEASE 


Resurgence of avian influenza virus 


Unprecedented outbreaks of the H5N1 highly pathogenic avian influenza virus raise concern 


By Michelle Wille?2* and lan G. Barr? 


ighly pathogenic avian influenza 
viruses (HPAIv) have recently been 
detected on the East coast of Canada 
(December 2021) and the United 
States (January 2022) and have been 
found in both wild birds and poultry 
(1). This comes after the same strain (known 
as H5N1) swept through Asia, Africa, and 
Europe in late 2021, replacing the previous 
HPAIv and causing widespread outbreaks 
and millions of deaths in poultry and wild 
birds. These HPAIv are of concern not only 
to birds but also to humans because they 
pose a potential pandemic risk. How has 
this virus emerged and spread so rapidly, 
and what does it mean for poultry, wild 
birds, and humans? 
Influenza A viruses infect humans and 
a wide range of animal species such as 
pigs, poultry, and many wild birds, includ- 
ing waterbirds, the natural hosts of these 
viruses. Influenza A viruses are classified 
based on differences in their two most abun- 
dant surface proteins, hemagglutinin (H) 
and neuraminidase (N), of which there are 
16 and 9 different subtypes, respectively, that 
infect avian species; these occur in differ- 
ent combinations, defining the virus (e.g., 
H5N1). Wild birds are infected with low- 
pathogenic avian influenza viruses (LPAIv), 
which they can carry asymptomatically. Only 
two H subtypes (H5 and H7) have repeatedly 
emerged as HPAIv to date, and these usually 
occur when LPAIv H5 and H7 viruses cross 
from wild birds into poultry, where changes 
in the H protein transform them into HPAN, 
causing severe disease, devastating out- 
breaks, and up to 100% mortality in chickens 
(2). Although the N subtype may play some 
role in virus transmissibility, it does not de- 
termine disease severity. 
The current circulating lineage of H5 HPAIv 
had its beginnings in Asia as early as 1996 
as the goose/Guangdong lineage (gs/GD). 
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After the emergence of a novel lineage and 
associated outbreaks in 2003, it spread to 
Asia, the Middle East, Africa, and Europe, 
where >400 million chickens, turkeys, and 
domestic ducks were culled and >600 human 
cases of H5N1 HPAIv were recorded within 
5 years (3). These viruses became endemic 
in the poultry of many countries in Asia and 
Africa, with the gs/GD lineage diversifying 
into several genetic clades as a result of high 
infection rates, vaccination pressure, and 
circulation of this virus in an array of avian 
hosts. Since 2014, after evolution of the H 
gene, a new gs/GD lineage designated 2.3.4.4 
has emerged, and these HPAIv have subse- 
quently dominated outbreaks globally. 

Why has the H5NI1 subtype combination 
become globally dominant again? The rea- 
sons for the current 2021-2022 wave of H5 
HPAI are likely to be multifactorial. The re- 
version back to using N1 is probably serendip- 
itous, but the current H5NI1 virus clearly has 
a fitness advantage over the previous H5N6 
or H5N8 HPAIv that predominated from 2014 
to 2021. This has allowed H5N1 to rapidly 
displace other H5Nx viruses. Normally such 
a dramatic global sweep would be caused by 
a major change in the influenza H protein, 
but the H of the current H5N1 is similar to 
that of the previous H5N6 and H5N8 viruses, 
falling into the same genetic lineage (2.3.4.4). 
This leaves properties in the N1 or changes 
to other viral genes through reassortment or 
mutation as possible reasons for increased 
fitness. However, there may be other host 
factors leading to this spread, such as being 
able to infect a broader range of wild birds 
or reaching higher viral loads in birds, caus- 
ing larger and more intense outbreaks. These 
matters need further, careful investigation. 

How has the rapid spread of H5N1 HPAIv 
occurred? Although HPAIv often emerge in 
high-density poultry production systems, it 
is wild birds that can spread these viruses 
globally. The role of wild birds in spreading 
HPAIv H5 before 2014 was unclear. However, 
in late 2014, wild waterfowl spread HPAIv 
H5Nx from their breeding areas in Siberia to 
Europe and North America (4) and to Africa 
in 2017 along their established migratory fly- 
ways. Wild birds have now reintroduced H5 
HPAIv into North America with detections in 
wild birds: great black-backed gulls (Larus 
marinus) in St. John’s Canada (November to 
December 2021) (4) and wild birds as well as 
backyard and commercial poultry flocks in 


the United States from January 2022 (7) (see 
the figure). HSNx HPAIv have now been in- 
troduced into all continents except Australia, 
South America, and Antarctica, which have 
probably been spared because of ecological 
barriers to waterfowl migration from areas 
of endemic circulation; however, this situa- 
tion could change. 

What are the consequences of the H5N1 
HPAIv spread? Although wild birds are able 
to spread these viruses through migration, 
the viruses may also cause their own mass 
mortality events. These mortality events in 
wild birds come with substantial conser- 
vation concern. For example, H5N1 HPAIv 
caused the death of hundreds of red knots 
(Calidris canutus) in the Netherlands in 2021 
(5), 10,000 migratory common cranes (Grus 
grus) in Israel in December 2021 (6), ~10% of 
the Svalbard breeding population of barnacle 
geese (Branta leucopsis) in late 2021 (7), and 
hundreds of Dalmatian pelicans (Pelecanus 
crispus) in northern Greece in 2022. 

Outbreaks of H5Nx HPAIv have also had 
a substantial impact on poultry production 
and cost the sector billions of dollars since 
their emergence. Data reported to the World 
Organisation for Animal Health (OIE) found 
that between 2005 and 2019, there were 
18,620 outbreaks in poultry reported across 
76 countries. In 2020-2021 alone there have 
been more than 3000 HPAIv events reported 
(8), with ~15,000,000 poultry losses (died 
or culled) globally as a result of H5Nx. The 
United Kingdom has experienced its biggest 
outbreak due to H5N1 HPAI, with millions 
of poultry culled; there have also been 2804 
outbreaks in Europe since October 2021 (up 
to 23 March 2022) (8, 9). 

Onward transmission of HPAIv to hu- 
mans is an ongoing threat that continues to 
be of pandemic concern. From 2003 to 2021, 
there have been 863 human cases of H5N1 
HPAI, resulting in 456 deaths reported to 
the World Health Organization (WHO), with 
virtually all of these cases occurring before 
2016. Although this earlier “classic” strain of 
H5NI1 caused severe infections in more than 
half of those infected, the number of people 
infected was relatively low given that mil- 
lions of birds and hundreds of thousands 
of poultry workers, farmers, and _ live-bird 
market-stall holders were in direct contact 
with infected birds. Furthermore, these in- 
fections were almost exclusively associated 
with contact with infected birds, with no sus- 
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tained onward human-to-human 
transmission detected, confirm- 
ing that these viruses were not 
well-enough adapted to humans 
to be a major concern. The more 
recent 2.3.4.4 H5 HPAIv have 
produced similarly low rates of 
human infections, possibly be- 
cause these viruses share the 
same ancestral gs/GD lineage. 
However, different pathogenicity 
has been observed when human 
H5 infections do occur, with the 
earlier H5N1 and H5N6 HPAIv 
appearing more severe and caus- 
ing more deaths in humans than 
H5N8 viruses. Notably, all these 
H5Nx HPAIv strains remain 
highly transmissible and deadly 
to poultry and, to a lesser extent, 
wild birds. 

H5N8 2.3.4.4 HPAIv have 
resulted in very few human 
cases despite their high circula- 
tion from 2014 to 2021. Several 
mild H5N8 infections were 
detected in December 2020 in 
poultry workers in Astrakhan 
Oblast (Russian Federation), 
none of whom required hospi- 
talization (J0). Three human 
infections with H5N? HPAIv 
(probably N1) were reported 
in Nigeria in March 2021, but 


Spread of H5Nx avian influenza 

The H5Nx (2.3.4.4) highly pathogenic avian influenza viruses (HPAlv) have spread 
globally since 2014, with unprecedented outbreaks of HSN1 infections in poultry 
and wild birds in 2021 and 2022, with some spillover human infections. Wild-bird 
migratory flyways can be linked with the occurrence of these outbreaks (arrows). 
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The maps are simplified representations of avian HPAlv lineage 2.3.4.4 outbreaks from the Emergency 
Prevention System for Animal Health (EMPRES) of the Food and Agriculture Organization of the United 
Nations. See text for references to human cases. 


to be potentially dangerous to 
humans, which can be rapidly 
accessed for vaccine manufac- 
ture if required. Effective H5 
HPAIv vaccines are also impor- 
tant in protecting poultry, and 
their use will reduce spillover 
into wild birds and humans. 
In 2017, mass poultry vaccina- 
tion with a combined H5 and 
H7 vaccine was successful in 
controlling H7N9 poultry out- 
breaks and human infections in 
China. After mass vaccination 
and the closure of live-bird mar- 
kets, H7N9 human infections 
dropped from 759 (with 281 
deaths) in 2017 to only two cases 
in 2018 (1/5). Notably, countries 
that do not use poultry influ- 
enza vaccines and instead rely 
on the culling of infected birds 
to control HPAIv outbreaks may 
now consider the implementa- 
tion of a vaccination program, 
given the impact of the current 
H5N1 outbreak. Additionally, 
measures such as _ reduction 
of flock size and density and 
avoidance of poultry produc- 
tion in waterbird-rich areas 
have been proposed to prevent 
spillovers of HPAIv into wild 
birds. The ongoing 2021-2022 


it is unclear if these were true 
infections or environmental carriage of vi- 
rus because all cases were asymptomatic. 
By contrast, between 2014 and 16 March 
2022, H5N6 has caused 77 human infec- 
tions with 32 deaths, with all but one case 
occurring in China. Of recent concern has 
been the apparent increase in H5N6 HPAIv 
human cases in China in 2021, where 33 
cases (out of 76 total cases) with 11 deaths 
were reported, along with a further 17 cases 
with at least five deaths in 2022 [up to 16 
March 2022 (17)]. These recently increased 
numbers may be due to enhanced respira- 
tory and pneumonia surveillance during 
the COVID-19 pandemic, but the situation 
needs to be closely monitored. Reassuringly, 
no onward human-to-human transmission 
has been detected from any of these H5N6 
or H5N8 HPAIv infections. To date, only a 
single human case with the recent 2.3.4.4 
H5N1 HPAIv has been reported—a 79-year- 
old man in the United Kingdom who was 
infected from local ducks in mid-December 
2021 but who remained asymptomatic (72). 
What would raise the concern for H5N1 
HPAIv for humans? A recent European as- 
sessment (73) determined that the risk of 
infection with H5Nx for the general popu- 
lation in Europe was low, and for occupa- 


460 29 APRIL 2022 + VOL 376 ISSUE 6592 


tionally exposed people, the risk was low to 
medium (but with high uncertainty) (/4). 
Further adaption would need to occur in this 
current virus to increase its ability to trans- 
mit between humans efficiently, a process 
that is only partially understood; although 
this has not occurred yet, the persistence of 
H5Nx HPA\Iv increases the likelihood. As with 
COVID-19, if this adaption occurred and the 
virus became efficient at airborne transmis- 
sion between humans, it would be virtually 
impossible to control. 

What can be done to curb the H5N1 
HPAIv threat? Although H5N1 and H5Nx 
HPAIv are, at present, a low threat to hu- 
mans, it is early in the outbreak and each 
potential pandemic threat should be taken 
seriously. To gauge the risk, international 
organizations such as WHO, OIE, and the 
Food and Agriculture Organization of the 
United Nations (FAO) all have programs to 
investigate HPAIv in humans, poultry, and 
wild birds. Furthermore, continued invest- 
ment in surveillance of wild birds and poul- 
try and of humans at the human-poultry 
interface are critical. To mitigate the po- 
tential pandemic threat from H5Nx viruses, 
WHO continually prepares a stock of those 
viruses (and others) that are considered 


wave of avian influenza H5N1 
is unprecedented in its rapid spread and 
extremely high frequency of outbreaks in 
poultry and wild birds and is a continuing 
potential threat to humans. ® 
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RETROSPECTIVE 


Eugene N. Parker (1927-2022) 


Pioneer of theoretical solar and plasma astrophysics 


By Stuart D. Bale 


strophysicist Eugene N. (“Gene”) 

Parker, who led the field of solar 

physics into the space age, died on 15 

March. He was 94. Gene was first to 

identify many of the fundamental con- 

cepts of Solar System plasma physics, 
including the physics of solar wind accelera- 
tion, coronal heating and magnetic reconnec- 
tion, cosmic ray diffusion, and a conceptual 
structure of the heliosphere and its interac- 
tion with the interstellar medium. His foun- 
dational work on the theory of the solar wind 
led to the flagship NASA mission that bears 
his name: the Parker Solar Probe. 

Gene was born in Houghton, Michigan, in 
1927 and attended Michigan State University, 
where he received his BS in physics 
in 1948. He earned his PhD at the 
California Institute of Technology 
(Caltech) in 1951 working with physi- 
cist Leverett Davis Jr. on problems of 
the interaction of dust with the inter- 
stellar medium. After a brief appoint- 
ment at the University of Utah, Gene 
joined the faculty at the University of 
Chicago, where he remained until his 
retirement in 1995. 

Gene’s early career coincided with 
the beginning of the space age. At 
that time, the prevailing wisdom was 
that interplanetary space was either 
a vacuum with sporadic solar per- 
turbations or a solar atmosphere in 
hydrostatic equilibrium. However, 
subtle clues suggested something 
quite different. German astronomer Ludwig 
Biermann had made a convincing case that 
one component of a bifurcated comet tail al- 
ways pointed away from the Sun, guided by 
a hypothetical “solar corpuscular radiation,” 
whereas the other part was dominated by 
photon pressure and Kepler’s laws. 

Gene recognized that Biermann’s cor- 
puscular radiation was consistent with a 
hydrodynamic, transonic, escaping atmo- 
sphere that he called the “solar wind.” His 
controversial 1958 solar wind theory, a kind 
of velocity-space analog of a rocket engine 
nozzle, was published only after the editor 
of the Astrophysical Journal overruled the 
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reviewers. However, the theory provided pre- 
dictions for measurements near Earth and 
suggested the existence of a “heliosphere,” 
a plasma bubble surrounding the Sun with 
a discontinuous interface to the interstel- 
lar medium. The general framework of the 
model was verified by Soviet and American 
spacecraft measurements in one of the early 
scientific triumphs of the space age. 

A 1960 Space Science Board report (the so- 
called Simpson Report) recommended both 
a solar probe to investigate the origins of 
Gene’s solar wind and an outer Solar System 
probe to study the interaction of the helio- 
sphere with the interstellar medium. Just 
2 years after the founding of NASA, Gene's 
work was already setting the goalposts for 
major space mission objectives. 


Gene went on to define, and often resolve, 
many of the major problems of solar astro- 
physics. He proposed the concept of mag- 
netic buoyancy to describe the origin and 
dynamics of sunspots, and he pioneered the 
models of cosmic ray diffusion and transport. 
He made major contributions to the theory 
of magnetic reconnection, the process that 
triggers solar flares and magnetospheric 
substorms. He elucidated the nature of the 
Maxwell stress tensor, showing that the low- 
est energy states are sheets of plasma cur- 
rents. The nanoflare model that Gene pro- 
posed to explain solar coronal heating is still 
used today. In a major contribution to tur- 
bulent dynamo theory, he showed that a po- 
loidal magnetic field arises directly from the 
dynamics of a toroidal field, an insight that 
completed the alpha-omega dynamo theory 


that explains the origin of planetary and as- 
trophysical magnetic fields. Gene wrote four 
books on topics of astrophysical plasmas, 
each in his trademark dialectical style. 

In 2010, NASA selected my team to design 
and build scientific instrumentation for the 
mission to investigate the origin and accel- 
eration of the solar wind. The solar probe of 
the 1960 Simpson Report was finally coming 
to fruition. I asked Gene to join our team as 
a senior adviser, and he happily accepted 
the role. “My advice is free and worth every 
penny of that, he quipped. As we waded 
through theories to define our measurement 
requirements, he urged us to keep it simple. 
He stood up at one of our early meetings to 
remind us thoughtfully that “hydrodynamics 
and magnetohydrodynamics are fundamen- 
tal, regardless of popular opinion.” 

Our Solar Probe Plus spacecraft—renamed 
the Parker Solar Probe in Gene’s honor— 
launched in August 2018. It was the first 
time that a NASA spacecraft had carried the 
name of a living scientist. The Parker Solar 
Probe spacecraft has so far made 11 orbits 
of the Sun, diving to within 13 solar radii of 
the solar photosphere. Gene stayed 
abreast of the Parker Probe results as 
we sent him preprints and publica- 
tions. As his health declined, he was 
content to cheer us on from the side- 
lines with the occasional handwritten 
letter. Many of Gene’s predictions and 
theories are being borne out by Parker 
Solar Probe measurements, notably 
the existence of an underlying hydro- 
dynamic wind speed profile—popular 
opinion be damned. 

Gene was known as a supportive 
and inspiring mentor to his students. 
He was deeply committed to striking 
a good work-life balance; his former 
student Bill Collins recalls him saying, 
“The University of Chicago pays me 
for 40 hours per week and that’s what 
they're going to get.” Clearly, he put that time 
to good use: His many honors include the US 
National Medal of Science, the Crafoord Prize 
awarded by the Royal Swedish Academy of 
Sciences, the Kyoto Prize, the Arctowski Medal 
of the US National Academy of Sciences, and 
the American Physical Society Medal for 
Exceptional Achievement in Research. 

The history of modern solar physics fol- 
lowed the arc of Gene’s career, and his death 
is an opportunity to reflect on both his 
achievements and the state of the field. Solar 
and heliospheric physics are now entering 
a golden age, with extraordinary commit- 
ments from NASA, the US National Science 
Foundation, and the European Space Agency. 
Gene Parker’s many insights played a funda- 
mental role in making that possible. 

10.1126/science.abq3164 
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A global system for the next 
generation of vaccines 


COVID-19 has shown that hurdles can be overcome 


By Nimalan Arinaminpathy', 
Chadi M. Saad-Roy’, Qiqi Yang’, Isa Ahmad??, 
Prashant Yadav*°, Bryan Grenfell*” 


he rapid development of vaccines 

against COVID-19 raises questions 

about what could be achieved in vac- 

cines to other major diseases. Influ- 

enza presents an important case study; 

it is one of the few infections that 
causes substantial public health burden in 
its endemic form while also having proven 
pandemic potential. We offer a first step in 
bringing together the value proposition of 
future influenza vaccines considering two 
key characteristics: the breadth of protec- 
tion that vaccines offer (against individual 
strains, all strains within a subtype, multiple 
subtypes, or all subtypes) and the duration 
for which protection remains effective (see 
the figure). We examine implications of these 
characteristics, from both epidemiologic and 
economic perspectives, and discuss how a fu- 
ture market for influenza vaccines might best 
align public health and economic incentives. 
Although many of these factors are specific 
to influenza, we consider comparisons with 
other major infections, such as tuberculosis 
and COVID-19. 

Current influenza vaccines, mainly tar- 
geting the variable region of the haemag- 
glutinin surface glycoprotein of the virus 
(1), cannot be used to protect a population 
in advance of a pandemic and are only de- 
ployed in routine immunization against 
specific variants of seasonal influenza. By 
contrast, through offering broad protec- 
tion, new vaccines could mitigate equally 
against both seasonal and pandemic in- 
fluenza. Such vaccines would be distinct 
among immunization programs by having 
value in both routine and strategic modes 
of deployment. Moreover, current influ- 
enza vaccines are mostly still produced by 
using the same egg-based methods that 
have been used for roughly the past 70 


years. New vaccines would transform these 
production methods. For example, both 
mRNA and viral vector technology bypass 
the need for candidate vaccine viruses to 
initiate the production process, requiring 
only the genetic sequence of the target an- 
tigens. Although such vaccines will depend 
on new technologies for their manufacture, 
the unprecedented pace and scale of rollout 
of COVID-19 vaccines have shown that such 
hurdles can be overcome. 


EPIDEMIOLOGICAL IMPLICATIONS 

The basic reproduction number for seasonal 
and pandemic influenza (R,)—the expected 
number of secondary cases produced by a 
typical infectious individual in a fully sus- 
ceptible population—is typically less than 2, 
meaning that even partial vaccination cov- 
erage and efficacy can have strong effects 
in reducing the burden of influenza. It is 
likely that a future influenza vaccine would 
offer less than lifelong protection, necessi- 
tating repeated vaccination over a lifetime. 
Depending on the duration of immunity, 
repeat vaccination may not need to be as 
frequent as every year. If the vaccine targets 
“conserved” antigens that do not evolve as 
rapidly as the variable region of haemag- 
glutinin (the target of current vaccines), the 
antigenic formulation of the vaccine may 
need not be updated as often as with cur- 
rent seasonal vaccines. 

Previous modeling work has illustrated 
that if the duration of immunity and the fre- 
quency of immunization are sufficiently high 
that population immunity is allowed to accu- 
mulate over time, a future influenza vaccine 
could exert greater reductions in seasonal 
influenza burden than that of a conventional 
vaccine with the same coverage and efficacy 
(2). Multiyear cohort studies of vaccinated 
versus unvaccinated individuals would pro- 
vide valuable data on the duration of vaccine- 
induced protection that will be important in 
planning future vaccination programs. 


On breadth, although an ideal vaccine 
would be effective against all subtypes of 
influenza A, in practice it has proven chal- 
lenging to achieve such universal breadth of 
protection. Previous work has shown how 
infection-induced, T cell-mediated hetero- 
subtypic immunity could protect against 
severe disease from pandemic influenza 
in humans (3). Several vaccine candidates 
aim to raise T cell immunity against targets 
such as the nucleoprotein and the matrix 
protein [for example, (4)], which are con- 
served across influenza subtypes. 

On the other hand, recent work has fo- 
cused on  antibody-mediated immunity 
against the haemagglutinin stalk, which, 
being critical for the stability of the overall 
haemagglutinin molecule, shows far less 
variability than the “head” region [for ex- 
ample, (5)]. However, stalk-based immunity 
is not universal: Because H1 and H3, the two 
subtypes of influenza A currently causing 
seasonal influenza in humans, belong to dif- 
ferent clades, stalk-reactive antibodies to H1 
may offer only limited cross-protection to H3, 
and vice versa. Vaccinating against H3 may 
therefore open an ecological niche, allowing 
greater epidemics of H1 than currently occur, 
and vice versa. Analogous effects have been 
observed for pneumococcus, in which wide- 
spread vaccination uptake against a limited 
set of serotypes has typically been followed 
by an increase in carriage of nonvaccine sero- 
types (although not necessarily accompanied 
by increases in disease) (6). 

Overall, for any future influenza vaccine 
offering less-than-universal protection, it 
will be critical to adopt complementary vac- 
cination strategies to close the “protection 
gap” (see the figure). For example, a future 
vaccination program could combine a stalk- 
based H3 vaccine with a conventional H1 
vaccine, or a combination of stalk-based H1 
and H3 vaccines. 


ECONOMIC IMPLICATIONS 

Although improved breadth and duration of 
protection both have positive implications 
for epidemiological impact, they could act 
in opposing directions in the context of 
manufacturing and distribution (see the fig- 
ure). A future influenza vaccine that offers 
broad protection could have higher global 
demand than that of current influenza vac- 
cines, giving rise to a substantially expanded 
market for manufacturers to serve. If such 
a vaccine does not need to be updated as 
frequently as current vaccines, as a result 
of targeting more conserved influenza anti- 
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Economic and epidemiologic considerations 
Boxes show key factors associated with improving vaccine performance in terms of duration (vertical axis) 


and breadth of protection (horizontal axis). 
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gens than haemagglutinin, this would also 
simplify the manufacturing process (7). 
However, although longer-duration im- 
munity could have important public health 
benefits, it could also pose important con- 
siderations for manufacturers. For example, 
comparing a vaccine that needs to be ad- 
ministered every year against one that is 
only required every 5 years, over time the 
former would have five times the potential 
market as the latter. This simple compari- 
son could be moderated by any increase in 
demand associated with longer-duration 
immunity; understanding such effects will 
be important in planning for manufactur- 
ing and distribution of future vaccines. 
Vaccines conferring longer-duration im- 
munity will also require careful planning 
in their manufacturing and deployment. 
As an extreme but illustrative example, if a 
country’s immunization program involves 
mass vaccination every 5 years, this would 
naturally lead to boom-and-bust cycles in 
demand; in intervening years, manufactur- 
ers will need to repurpose production lines 
for other vaccines. Where different coun- 
tries follow different immunization pro- 
grams, this could lead to demand volatility, 
potentially compromising sustainability 
for manufacturers. Alternative immuni- 
zation schedules could help to smooth 
demand over time, such as “cohort” vac- 
cination strategies focused each year on 
5-year-olds, 10-year-olds, and so on. Such 
strategies are routinely used for childhood 
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vaccines such as DTaP against diphtheria, 
tetanus, and pertussis. 

Other approaches to address demand 
volatility could include centralized purchas- 
ing mechanisms to coordinate and stag- 
ger vaccine procurement among different 
countries, helping to create a market that 
remains stable over time. Examples of such 
mechanisms include the Pan American 
Health Organization (PAHO) revolving fund 
and the United Nations Children’s Fund 
(UNICEF) Supply Division, which acts on 
behalf of Gavi, the Vaccine Alliance. 


GLOBAL PRODUCTION CAPACITY 

At present, production of influenza vaccines 
is heavily concentrated in high-income set- 
tings. Manufacturers in high-income coun- 
tries, accounting only for 16% of the global 
population, account for an estimated 69% 
of seasonal influenza vaccine production 
and 80% of potential pandemic vaccine 
production (8). These disparities mirror 
global imbalances in the uptake of influ- 
enza vaccines: Although the United States 
has among the world’s largest coverage of 
routine influenza vaccination as a propor- 
tion of its population, uptake is much more 
limited in India and other low- and middle- 
income countries (LMICs). Reasons for 
such low uptake (9) include lack of aware- 
ness of the disease burden of influenza, as 
well as the need to vaccinate annually, and 
the cumbersome process of deploying vac- 
cines that must be reformulated regularly 


as the virus evolves. It is likely that a future, 
broadly protective vaccine, with similar ef- 
ficacy to that of current vaccines but offer- 
ing multiyear protection while not needing 
to be reformulated so often, would address 
some key issues for uptake in LMICs. 

Alongside increasing uptake to facilitate 
the emergence of a global market, we ar- 
gue that there will be several benefits to 
promoting regional production to meet 
local needs while leveling current imbal- 
ances in global production capacity. From 
an epidemiological perspective, future in- 
fluenza vaccines could be used strategically 
for pandemic preparedness and response. 
As demonstrated during the COVID-19 
pandemic, global supply of pandemic vac- 
cination is not always aligned with global 
need because countries prioritize coverage 
among their own populations. If increased 
future demand for new influenza vaccines 
facilitates the establishment of regional 
production capacity to meet local needs, 
this distributed capacity will be valuable 
in ensuring that all countries have the 
necessary access to vaccines for pandemic 
response. This benefit need not be limited 
to influenza pandemics, with new technolo- 
gies such as mRNA and viral vectors be- 
ing applicable to other infections as well. 
As long as regional vaccine manufacturing 
sites are sufficiently flexible, they could be 
deployed rapidly against a wide array of 
emerging infections (10). New initiatives al- 
ready underway to expand manufacturing 
capacity for COVID-19 and other vaccines 
in Africa and elsewhere could represent im- 
portant steps toward equitability in global 
pandemic preparedness. 

From an economic perspective, for both 
seasonal and pandemic vaccination, an ex- 
panded market with the right number of 
manufacturers allows production to benefit 
from economies of scale (17), potentially al- 
lowing lower vaccine prices at least for the 
poorest countries. A large, globally distrib- 
uted market could establish a virtuous cycle 
by reducing local costs of broadly protec- 
tive vaccines, which in turn could facilitate 
increased uptake (and consequently com- 
mercial attractiveness). Gavi exerted such 
effects on the global market for the pen- 
tavalent vaccine: Their coordinated market- 
shaping initiatives led to more manufac- 
turers entering the market, which in turn 
reduced vaccine prices (12). 

A much-expanded market could also 
lower costs by facilitating entry of new 
manufacturers. Currently, only a few low- 
or lower-middle-income countries have 
capacity to produce vaccines that meet 
stringent quality standards [approved by 
stringent regulatory authorities or _ be- 
ing World Health Organization (WHO)- 
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prequalified]. Use of newer platforms such 
as recombinant proteins, adenovirus vec- 
tored, or lipid-encased mRNAs is largely 
limited to India and China (3). Developing 
such manufacturing capacity in other coun- 
tries requires intensive “soft” infrastructure 
in the form of chemistry, manufacturing, 
and control (CMC) staff and “hard” infra- 
structure in the form of new equipment for 
manufacturing and laboratory and analyti- 
cal testing. New manufacturers who invest 
in this infrastructure are likely to operate 
on a smaller scale compared with that of 
manufacturers in India and China and are 
therefore likely to see increased costs of 
production. A much expanded market could 
therefore facilitate the entry of these manu- 
facturers by offsetting these increased costs. 


ROLES FOR DONORS AND POLICY-MAKERS 
Catalyzing the development of such a global 
market is likely to require effective partner- 
ships between industry, policy-makers, and 
academia. There will be a critical role for 
international agencies and donors to play, 
not only in the research and development of 
new vaccines but also in their manufactur- 
ing and distribution. 

Mechanisms such as advance market 
commitments could attract greater industry 
participation in the process of vaccine de- 
velopment. To the extent that new vaccines 
depend on new technologies, they will also 
require new mechanisms for evaluation of 
their performance. Pathways for regulatory 
approval will need to be updated, to incor- 
porate these new mechanisms. A global 
market will require updates to regulatory 
approval to be coordinated at a global level 
to facilitate the deployment of new vaccines 
worldwide. It is likely that licensure of fu- 
ture vaccines will proceed in steps rather 
than leaps—for example, with breadth and 
duration of protection being steadily im- 
proved with each successive generation of 
vaccine development. Therefore, although 
“target product profiles” that identify ideal 
characteristics for future vaccines would of- 
fer helpful reference points in vaccine de- 
velopment and international regulatory co- 
ordination, it will be equally important for 
donors and international agencies to plan 
how best to recognize and reward interme- 
diate stages toward achieving those ideals. 

Once vaccines are developed and licensed, 
there will be a need for further interventions 
in their distribution, both on the demand side 
and the supply side. Creating a sustainable 
demand side requires mechanisms for long- 
term public funding of vaccine purchases by 
country governments, global coordination of 
influenza vaccines for LMICs, and if needed, 
using pooled procurement structures such 
as UNICEF and the PAHO revolving fund 
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for LMIC purchase of influenza vaccines. 
On the supply side, there will be a need 
for concessional loans and other forms of 
investment from development finance insti- 
tutions (such as the International Finance 
Corporation) to finance the capital expen- 
diture for setting up new sites in LMICs for 
influenza vaccine manufacturing. 

In addition to plant and equipment and 
technical know-how, vaccine manufactur- 
ing—especially using newer cell-based plat- 
forms and mRNA—requires many complex 
input components, some of which have 
limited sources of supply. A globally coor- 
dinated mechanism is needed to manage 
the material, know-how, and other criti- 
cal resources required for the global mar- 
ket to operate. The WHO Global Action 
Programme initiative offers an example of 
such a coordinated mechanism, in the con- 
text of current influenza vaccines (8). As 
new technology platforms emerge, new co- 
ordination architectures may have to be de- 
signed to accommodate their deployment. 


COMPARISONS WITH OTHER INFECTIONS 
To illustrate how the value propositions 
for future vaccines for other diseases could 
depend on the disease in question, we 
consider two other respiratory diseases, 
COVID-19 and tuberculosis (TB). 

Like influenza, coronaviruses are estab- 
lished in the human population and have 
proven pandemic potential. Although li- 
censed vaccines to SARS-CoV-2) raise im- 
munity to the surface spike protein, there 
is increasing interest in targeting more 
conserved antigens to raise more broadly 
protective immunity. Although it will be 
challenging to achieve a truly universal 
coronavirus vaccine, recent work has shown 
promising signs of raising broad protection 
against sarbecoviruses [for example, (14)], a 
lineage to which SARS-CoV-2 belongs, along 
with SARS-CoV-1, which first emerged in 
humans in 2003. Owing to the global bur- 
den of SARS-CoV-2, and the need to pro- 
tect against future coronavirus pandemics, 
many of the considerations raised here (see 
the figure) would apply to future corona- 
virus vaccines as well. However, endemic 
coronaviruses have only modest contribu- 
tion to the burden of respiratory illness, 
particularly in comparison with seasonal 
influenza; SARS-CoV-2 may, over time, simi- 
larly transition to being an endemic virus 
causing mostly mild illness. Accordingly, 
compared with influenza vaccines, the pub- 
lic health value proposition of future coro- 
navirus vaccines may have a greater empha- 
sis on pandemic protection than on routine 
immunization. However, a sizable burden 
of “long covid” after endemic SARS-COV-2 
infection could alter this balance. 


For TB, the world’s leading cause of death 
from infectious disease, the public health 
case for new, effective vaccines is clear. 
However, with global TB burden overwhelm- 
ingly concentrated in LMICs, the potential 
market for future TB vaccines is not as com- 
mercially promising as for other vaccines. 
Hence, despite a recent phase 2b trial of the 
M72/ASOle vaccine showing promising signs 
of efficacy (75), it has been challenging to 
find industry partners who are willing and 
capable of taking this candidate forward to 
phase 3 trials. The Bill and Melinda Gates 
Medical Research Institute (Gates MRD will 
support clinical trials of the vaccine, having 
recently acquired the license from Glaxo 
SmithKline. Even though future vaccines 
against TB and other diseases of poverty may 
well benefit from new technologies deployed 
against COVID-19, international public 
health agencies and donors will need to play 
an even more active role than in the context 
of influenza vaccines. 

A globally distributed market for influenza 
vaccines, supported by regional production 
capacity to meet regional need, would be one 
way of aligning public health, supply resil- 
ience, and commercial interests but will not 
necessarily develop on its own; international 
public health agencies and donors will have 
a key role to play, in catalyzing the creation 
of such a truly global system. 
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INSIGHTS 


Landscapes 
of the 
Anthropocene 


The late artist Per Kirkeby’s 
preoccupation with 
geology is on display in 

a new exhibition 


By Deborah Dixon 


eological Messages, the collection on 
show at the Michael Werner Gallery 
in London, consists of 24 paintings 
in various mediums by the acclaimed 
Danish artist Per Kirkeby (1938- 
2018). Over the years, reviewers have 
compared the pieces created by Kirkeby— 
who initially trained as a geologist—with 
various artistic traditions in order to draw 
out how he deals with the materialities of 
light, shadow, substance, and system. But 
this collection asks us to consider Kirke- 
by’s “decades-long preoccupation with the 
world as geologically unstable, ductile, 
and everywhere moving between states of 
mineral complexity and processes 
of collapse” (1). 
Geologists have deployed a wide 


Geological Messages: 


Geological cross sections come to life in Geologische Nachrichten (Geological Messages), painted in 1999. 


This focus on the exposure of the depths 
of Earth has a long geological lineage. Kirke- 
by’s 1967 painting Dunkle Hohle (the Dream 
about Uxmal and the Unknown Grottos 
of Yucatan), which looks outward from a 
limestone grotto, echoes, for example, the 
sketched excavations of Kirkland Cave in 
Yorkshire by Victorian geologist William 
Buckland (2), whose metaphysics of sequen- 
tial extinctions and emergences veered be- 
tween creationism and catastrophism. 

In Kirkeby’s 1992 oil on canvas painting 
Inferno IV, we can find another geological 
aesthetic. In place of the tension between 
surface form and depth, there appear 
granularities and fractures. This 
piece echoes the thin-section mi- 
croscopies of minerals, ores, and 


: eg i Paintings . 
array of visualization techniques, from 1965-2015 meteorites where the passage of 
from sketches and lithographs to Per Kirkeby white light through the material 
models and scans, to investigate Michael Werner Gallery, reveals various hues and intensi- 
the transformation of landforms. London, UK, ties, as well as a textural world 
through 21 May 2022 


Such techniques animate a series 
of aesthetics specific to geology and in- 
clude a mapping of surface-subterranean 
relationships, the forensic examination of 
grain-scale characteristics for signs of cos- 
mic import, and the slice through deep time 
that is the geological cross section. 

The earlier works on display in this ex- 
hibition are mixed media on Masonite, a 
material formed by steaming, stretching, 
and pressing wood fibers. We see in these 
paintings the exposure of the subterranean 
in striated sandstone buttes eroded by wa- 
ter and wind, and in caves dissolved into 
karst by rainwater percolating downward. 
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of shapes, cleavages, facets, and 
inclusions. Peering at Kirkeby’s forensic 
grammar of scraped-on paint and scrib- 
bled lines, we can find a narration of tec- 
tonic uplift, eruptions, the appearance and 
disappearance of oceans, and the continual 
dynamism not just of Earth but of the cos- 
mos within which it revolves. 

In Geologische Nachrichten (Geological 
Messages), an oil on canvas painted in 1999, 
it is the geological cross section that comes 
into view. Planes and folds are mapped out 
in block color and serried black outlines. But 
this is a cross section that refuses the careful 
stratigraphy of a Charles Lyell, James Hut- 
ton, or G. K. Gilbert. Kirkeby’s patchwork 
terrains seem to exist in their own separate 
times and spaces, their planar surfaces illu- 
minated by different light sources. 


This strangeness unfolds in Kirkeby’s 
more recent, untitled works, which retain 
lithic components but situate them amid 
organic formations such as tree trunks, 
the grains of which echo fault lines. Across 
these visually lush paintings there are 
branches and leaves, as well as a snake, a 
rabbit, and birds. The mixed media on Ma- 
sonite of these paintings reiterates the mix 
of shapes, colors, and lines that are uncan- 
nily suggestive of undergrowth, crystals, 
swarms, and the fronds of underwater—or 
aerial—life-forms. 

As rock becomes part of the Anthropo- 
cene’s hybrid, strange ecologies, a different 
geological message emerges. The Arctic and 
northern European landscapes that Kirkeby 
references are undergoing ever-quickening 
transformations under anthropogenic cli- 
mate change, including the movement 
of mass materials. Communities that for 
generations have known the alignments of 
rock, vegetation, rivers, and animal life are 
facing increasing uncertainty. 

Kirkeby’s “geological messages” are a 
very welcome reminder of the shared aes- 
thetics and practices that have animated 
geology and art, as well as the planetary 
and cosmic narratives that geology gives 
us. For audiences now, these messages 
seem to be telling the time of the Anthro- 
pocene, not as another stratum that over- 
lays an earthly archive built from stone but 
as a condition of estrangement and aliena- 
tion that requires new ways of connecting 
with Earth. 
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HISTORY OF SCIENCE 


Rethinking the “Western” revolution in science 


A historian sees global cultural and geopolitical roots in Europe's scientific breakthroughs 


By Jorge Cafiizares-Esguerra 


istorian James Poskett sees modern 

science as the product of ceaseless 

cross-cultural, geopolitical transfor- 

mations. In his new book, Horizons, 

Poskett argues that the so-called 

“Western” revolution in science was 
actually the result of global transcultural 
and transregional interactions, all triggered 
by global socio-cum-geopolitical shifts. He 
brings his model of transculturation and 
global geopolitics transformations to bear on 
nearly every major breakthrough in modern 
science, from Newton’s physics to Lavoisier’s 
chemistry to Mendel’s genetics. The result is 
a bold new interpretation of the history of 
the field. 

Take, for example, the work of Coperni- 
cus. Poskett locates Renaissance astronomy 
within the much larger context of the 
expansion of Islam in North Africa 
and Eurasia. Islam, he reveals, de- 
manded great accuracy in the obser- 
vations of solar, lunar, and planetary 
movements, which were used to or- 
ganize ritual calendars and sanction 
political power. The fall of Constan- 
tinople in 1453 triggered an influx 
of texts, exiles, and learned captives 
into Europe, and Copernicus drew on 
these resources when formulating his 
heliocentric theory of the Universe. It 
was therefore, in Poskett’s estimation, 
the combined effort of many scholars, 
rather than the work of a lone genius, 
that led to the demise of Aristotelian 
and Ptolemaic geocentric models. 

Similarly, Poskett demonstrates 
how all the key evidence Isaac New- 
ton relied on to revitalize physics 
came from comparative studies con- 
ducted in equatorial and Arctic loca- 
tions. To reach isolated islands in the 
Pacific to obtain such data, nations 
needed considerable seafaring capa- 
bilities. Ultimately, Poskett argues, 
it was the Atlantic slave trade that 
made the accumulation of evidence 
for Newtonian physics possible. 

According to Poskett, 19th-century 
industrialization, nationalism, settler 
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colonialism, and imperialism drove the de- 
velopment of evolutionary biology, particu- 
larly Darwinian natural selection. The idea 
of evolution as the survival of the fittest was 
a trope that informed the development of 
national armies and frontier societies. Nine- 
teenth-century Argentina, Russia, Japan, 
and China, he notes, excelled in paleontology 
and evolutionary biology. 

It was the pursuit of communication in 
scattered imperial polities that led to break- 
throughs in telegraphy and radio, particu- 
larly in Russia, Japan, and China, argues 
Poskett. The rush to industrialization in the 
19th century, in turn, sparked much research 
in chemistry. During this period, the Russian 
chemist Dmitri Mendeleev spearheaded in- 
quiries into the periodic table, and the Japa- 
nese physicist Hantaro Nagaoka headed up 
efforts that led to the first model of the atom. 


Islamic astronomy informed the work of Copernicus, argues Poskett. 


Horizons: 

The Global Origins 
of Modern Science 
James Poskett 
Mariner Books, 2022. 
464 pp. 


Poskett sees Marxist ideologies and Cold 
War conflict as two of the main forces be- 
hind the global spread of relativity, quantum 
mechanics, and genetics. The new physics 
advanced by Einstein and Bohr was received 
by Bolshevik radicals in Russia as a welcome 
challenge to bourgeois complacency, he ar- 
gues. Nationalists and republican radicals in 
India and China also embraced these ideas 
with gusto. Meanwhile, the fear of famine 
and atomic warfare during the Cold War 
greatly facilitated the spread of population 
and agricultural genetics in Mexico, 
China, and India. 

Poskett’s main contribution with 
this book is to demonstrate that “Eu- 
ropean” knowledge has long been 
the result of global efforts and that 
science is intimately attached to co- 
lonialism, capitalism, slavery, indus- 
trialization, and geopolitical conflict. 
Poskett offers countless examples of 
non-European scientists whose re- 
search changed the sciences in radi- 
cal new ways. 

Yet, for all his sharp insights, Pos- 
kett remains firmly grounded in Eu- 
rocentric teleologies. In his account, 
science moves through familiar Eu- 
ropean markers of progress: Coper- 
nicanism, Newtonianism, Linnaean 
natural history, Maxwellian electro- 
magnetism. And nearly all the non- 
European innovators he highlights 
first trained in either European or US 
institutions. Nor does he question the 
Eurocentric assumptions that underlie 
the historiographical categories that 
are traditionally used to organize the 
history of science (the print revolu- 
tion, the Republic of Letters, the public 
sphere, the Enlightenment, democracy, 
and the Industrial Revolution). Despite 
these shortcomings, Horizons is nev- 
ertheless a challenging book that de- 
serves a wide readership. & 
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Edited by Jennifer Sills 


Retraction 


We would like to retract the Report 
“Asymmetric remote C-H borylation of 
aliphatic amides and esters with a modu- 
lar iridium catalyst” (1) after discover- 

ing that the reported enantioselective 
gammaz-selective C-H borylation of N,N- 
dibenzylhexamide (1a) is not reproducible 
and that many of the nuclear magnetic 
resonance (NMR) spectra were manipu- 
lated. For example, the signal patterns of 
the 'H NMR spectra of boronate (R)-2b 
(4-boryl-N-tert-butylhexamide) and (R)-2¢ 
(tert-butyl 4-boryl-hexanoate) (S129, S144) 
are identical, the baselines of the “C NMR 
spectra of (R)-2b and (R)-2g are identical 
(S130, $145), duplicative signals are found 
in the 'H NMR spectrum of boronate 
(R)-2f (ethyl 4-boryl-hexanoate) (two trip- 
lets in 0.7 to 0.9 ppm and two multiplets 
in 2.2 to 4.1 ppm) (S141), and the baseline 
of (R)-2f (S142) is also identical to those 
of (R)-2b and (R)-2g. These issues under- 
mine our confidence in the integrity of the 
study as a whole. We regret any confusion 
and apologize to the scientific community. 
All authors have agreed to retract this 
Report, and an institutional investigation 
of misconduct is underway. 
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Evolutionary risks of 
osprey translocations 


The osprey Pandion haliaetus is a mostly 
migratory, cosmopolitan raptor (/) that 
was historically extirpated or highly 
depleted in southern Europe. Because the 
species is charismatic, it has garnered 
public attention that has stimulated 
broad conservation action, fundraising, 
and political commitment. Osprey trans- 
location programs have been a primary 
conservation strategy in Europe since 
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1996 (1). However, because evidence sug- 
gests that translocation programs that 
move individuals without accounting for 
local adaptations between populations 
have negative short- and long-term effects 
(2-5), this approach should no longer be 
used to meet conservation goals. 
Translocations over the past several 
decades have contributed to increases in 
southern and western osprey populations 
(1). However, in most cases source popu- 
lations were from distant geographical 
areas of central and northern Europe (6), 
from which native Mediterranean osprey 
populations differ in both genetics (6) and 
migratory behavior (6-8). Whereas native 
Mediterranean populations are residents 
or short-distance migrants within the 
Mediterranean basin (7), breeding birds 
from reintroduced populations in Spain 
and Portugal maintained a long-distance 
migratory pattern, similar to birds from 
the donor populations (8). 
Reintroductions that are not consistent 
with the evolutionary history of a spe- 
cies, and that will likely alter the species’ 
future evolution, could prove detrimental 
to conservation (2-5). Evidence suggests 


An osprey chick from an endangered population sits in a nest on the rocky cliffs of Corsica. 


that mortality of translocated animals, 
mixing with neighboring local populations 
(2), and changes in migratory and dis- 
persal behavior (2, 5, 8) are likely to take 
place in the short term. Long-term effects 
could include genetic homogenization 
that annihilates part of the genetic diver- 
sity of the species at broader spatial scales 
(2-4) and decreases in the species’ ability 
to adapt to climate change. 

In June, the local government of 
Generalitat Valenciana plans to release 
near Valencia, Spain, large numbers of 
ospreys originating in central Europe 
(9). Although the project was conceived 
as a way to improve conservation, it 
instead focuses on the short-term social 
and political benefits associated with the 
return of a totemic species. By prioritizing 
short-term goals, the plan disregards the 
demonstrated risks of such translocations 
and the recommendations of international 
conservation agencies (10-12). 

We warn the scientific community 
and the Spanish authorities against the 
planned actions and recommend appro- 
priate international assessments to ensure 
that all future reintroductions adhere to 
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science-based recommendations. Genetic 
and behavioral processes that occur on 
evolutionary time scales are at least as 
important as those that are readily visible 
at the time scales of most conservation 
actions (10-12). 
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How to weaken Russian 
oil and gas strength 


Oil and gas exports represent Russia’s 

key geopolitical strength, as well as its 
major source of hard currency revenues 
(1-3). At current prices, these are esti- 
mated at around US$1 billion per day (4), 
representing an important lifeline for an 
economy under heavy financial sanctions 
in response to Russian President Vladimir 
Putin’s invasion of Ukraine. The European 
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Union (EU) buys 75% of Russian gas 
exports and 50% of Russian oil exports 
(5). An EU embargo on Russia would sub- 
stantially undermine Putin’s geopolitical 
and economic positions. However, given 
a full embargo’s potential impacts on 

the EU, securing the approval of all EU 
countries is difficult. To limit Russia’s oil 
and gas revenues while keeping up the 
flows, the EU should instead introduce a 
tariff on Russian oil and gas imports. The 
tariff can be adapted to the economic and 
political dynamics of the conflicts. 

A tariff’s effect on domestic prices 
depends on the relative elasticities of 
supply and demand—i.e., on whether 
sellers and buyers have relatively better 
alternatives. The more inelastic the sup- 
ply (e.g., because Russian exports cannot 
be diverted) and the more elastic the 
demand (e.g., because the EU can replace 
Russian supplies), the more of the tariff 
will be paid by the supplier (6). Russian 
oil and gas exports to Europe are inelastic 
in the medium term because infrastruc- 
tural bottlenecks prevent a substantial re- 
direction to Asia. The EU therefore has a 
real chance to ensure that tariff revenues 
are mostly paid by Russia. 

To improve its position, the EU needs 
to increase its demand elasticity. This 
can be done by incentivizing a reduction 
of oil and gas demand in Europe and by 
increasing the use of all available alterna- 
tive energy resources. By implementing 
a bold energy strategy, Europe can cred- 
ibly threaten to cut Russia’s oil and gas 
revenue while minimizing the domestic 
economic consequences of a tariff. 
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A global plastic treaty 
must cap production 


In March, the UN Environment Assembly 
adopted a resolution to combat plastic pol- 
lution with a global and legally binding 
plastics treaty by 2024 (1). In his News In 
Depth story “United Nations to tackle global 
plastics pollution” (25 February, p. 801), E. 
Stokstad discusses many of the ambitious 
provisions that were included, such as a 
consideration of the whole plastic life cycle 
and binding targets. However, it is unclear 
whether the treaty will include a cap on pro- 
duction or cover plastic chemicals. Despite 
interventions by the industry (2) and objec- 
tions from the United States and other del- 
egations, reducing plastics at the source by 
curbing production is critical. 

The current mass of plastic production is 
at about 450 million tons annually and set 
to double by 2045 (3). The immense quantity 
and diversity of both plastics and plastic 
chemicals, the total weight of which exceeds 
the overall mass of all land and marine ani- 
mals (4), already poses enormous challenges. 
Ensuring the safety of every available plastic 
and chemical is impossible, as their rates 
of appearance in the environment exceed 
governments’ capacities to assess associated 
risks and control problems (5). Plastic pol- 
lutants have altered vital Earth system pro- 
cesses to an extent that exceeds the thresh- 
old under which humanity can survive in 
the future (i.e., the planetary boundary) (5). 
Because legacy plastics in the environment 
break down into micro- and nanoparticles 
(6), this form of pollution is irretrievable and 
irreversible (6). In addition to the risks for 
human and environmental health, the whole 
life cycle of plastic accounts for 4.5% of our 
current greenhouse gas emissions (7) and 
could consume 10 to 13% of our remaining 
CO, budget by 2050 (8). The growing pro- 
duction and inevitable emissions of plastics 
will exacerbate these problems (6). 

Failing to address production will 
lead to more dependence on flawed and 
insufficient strategies. Some waste man- 
agement technologies, such as forms of 
thermal and chemical recycling, cause 
socioeconomic and environmental harm 
(9). Much of the plastic waste is currently 
exported from the North to the Global 
South, which poses a substantial threat to 
marginalized and vulnerable communities 
and their environments (J0). Even when 
applying all political and technological 
solutions available today, including sub- 
stitution, improved recycling, waste man- 
agement, and circularity, annual plastic 
emissions to the environment can only be 
cut by 79% over 20 years; after 2040, 17.3 
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million tons of plastic waste will still be 
released to terrestrial and aquatic envi- 
ronments every year (11). To fully prevent 
plastic pollution, the path forward must 
include a phaseout of virgin plastic pro- 
duction by 2040 (72). 
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MARINE BIOLOGY 


Local adaptation 
in coral reefs 


he Clerke Reef atoll at the Rowley 

Shoals in Western Australia provides 

a model habitat with which to 

study how spatially varying selec- 

tion drives ecological divergence. 
Studying the broadcast spawning coral 
Acropora tenuis, Thomas et al. examined 
individuals from lagoon and reef-flat loca- 
tions through genome and transcriptome 
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sequencing combined with common 
garden heat-stress experiments. Corals 

in the lagoon were shown to be more heat 
tolerant, and the gene expression base- 
line also differed between the two sites. 
Understanding existing genetic variation 
across habitats could help to protect and 
manage reefs in the upcoming decades. 
—MM Sci. Adv. 10.1126/sciadv.abl9185 (2022). 


Spatial variation in lagoon habitats at the 
Clerke Reef atoll has given rise to corals with 


increased tolerance to heat. 


NEUROSCIENCE 
The spinal circuit for 


pleasant touch 


Pleasant touch (e.g., cuddling, 
caressing, and patting) encodes 
positive hedonic information that 
facilitates emotional bonding, 
affiliative behavior, and the well- 
being of social animals. Despite 
its profound importance, how 
pleasant touch information is 
encoded and transmitted from 
sensory neurons to the spinal 


cord remains unknown. Liu et al. 


identified interneurons in lami- 
nae II of the spinal cord dorsal 
horn that express prokineticin 
receptor 2, as well as sensory 
neurons that express the ligand 
prokineticin 2 that are involved 
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in the encoding and transmit- 
ting of pleasant touch. Genetic 
ablation of these neurons 
selectively abolished the place 
preference that mice developed 
in a pleasant touch-condi- 
tioned place preference test. 
However, the sensation of pain 
and itch remained unaffected 
in the mutant animals. —PRS 
Science, abn2479, this issue p. 483 


TUMOR IMMUNOLOGY 
ACAR for killing, 
not toxicity 


Chimeric antigen receptor 
T (CAR-T) cell therapies are 
approved by the US Food 
and Drug Administration for 


treating blood cancers but 

are often limited by toxicities 
and difficult and expensive 
production. Developing 
more-streamlined CAR-T cell 
therapies is crucial. Vasic et 

al. used mouse and human 
double-negative (DN) T cells 
as a CAR-T cell therapy, which 
showed robust efficacy in 
mouse models of blood and 
lung cancers. They found that 
the DN CAR-T cells were just as 
effective as conventional CAR-T 
cells but induced no toxicity. 
The DN CAR-T cells could be 
produced from mixed donors, 
were not rejected from hosts, 
and could be frozen for long 
periods of time, suggesting 
that they could be an attractive 


“off-the-shelf” CAR-T cell 
therapy option. —DAE 
Sci. !mmunol. 7, eabl3642 (2022). 


HYDROGELS 
Converting forces 
into ionic signals 


Piezoelectric materials can 
convert mechanical forces into 
electrical signals and are used 
as the active material in many 
pressure sensors. However, 
biological systems tend to be 
based on the movement of ions 
rather than electrons. Dobashi 
et al. looked at the piezoionic 
effect of several kinds of hydro- 
gels. Hydrogels are designed 
so that the anions and cations 
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have a different mobility; thus, 
when the material is squeezed, 
it causes an ionic gradient 
that generates voltage. The 
authors demonstrate sev- 
eral potential applications, 
including a piezoionic skin and 
peripheral nerve stimulation, 
to demonstrate the possibil- 
ity of self-powered piezoionic 
neuromodulation. —MSL 
Science, aaw1974, this issue p. 502 


Self-inflicted damage 
protects tumors 


Genotoxic treatments, such 
as radiation and some chemo- 
therapy drugs, are a mainstay of 
cancer therapy, but they often 
fail to fully destroy tumor cells. 
Normal cells can protect them- 
selves from genotoxic insults 
by activating the G, cell cycle 
checkpoint, but this checkpoint 
is often dysfunctional in tumors. 
By contrast, Larsen et al. 
discovered that tumor cells can 
activate a nuclease that causes 
limited induction of DNA breaks 
at specific sites, which is coordi- 
nated with the process of DNA 
break repair. These self-inflicted 
DNA breaks trigger the G, cell 
cycle checkpoint, preventing 
tumor cells from cycling and 
protecting them from death due 
to treatment-induced DNA dam- 
age. —YN 

Science, abi6378, this issue p.476 


Fleeting chromatin loops 
The genome is organized 

into three-dimensional (3D) 
domains that are widely 
thought to be stable, fully 
looped structures, although 
this organization has not been 
directly observed in living cells. 
Gabriele et al. report the direct 
visualization of chromatin 
looping in living cells and used 
Bayesian inference to quan- 
tify looping dynamics. Loops 
were found to be both rare and 
short-lived, overturning static 
models of looping. Instead of 
being fully looped, 3D genome 
domains existed overwhelm- 
ingly in partially folded 


472 


configurations. This more 
dynamic view of 3D genome 
domains may ultimately allow 
a deeper understanding of why 
the disruption of some domains 
and loops causes dysregulation 
of gene expression in disease. 
—D§J 

Science, abn6583, this issue p.496, 


A gamma-ray pulsar 
timing array 
After galaxies merge, the super- 
massive black holes (SMBHs) 
at their centers are expected 
to form binaries that emit 
gravitational waves at nano- 
hertz frequencies. Numerous 
SMBH binaries throughout the 
Universe should combine to 
produce a gravitational wave 
background. Existing searches 
for this signal use radio obser- 
vations of pulsars as sensitive 
clocks and look for small shifts 
in the pulse timings. The Fermi- 
LAT Collaboration implemented 
a pulsar timing array using 
gamma rays and achieved a 
sensitivity close to that of the 
radio approaches. The results 
set an independent upper 
limit on the gravitational wave 
background, which is subject to 
different noise sources. —KTS 
Science, abm323], this issue p.521 


Carbon excision 
Quinolines and indoles are both 
very common core motifs in 
drug molecules. Because they 
differ by just a single carbon 
in their ring frameworks, it 
would be useful to interconvert 
them during structure-activity 
relationship studies. However, 
aromatic stabilization in these 
compounds makes that process 
difficult. Woo et al. report that 
narrow-wavelength irradia- 
tion of quinoline N-oxides in 
the near-ultraviolet range 
followed by acid treatment 
cleanly excises a ring carbon 
to produce an indole, avoiding 
secondary photoproducts that 
were previously observed using 
broadband light. —JSY 

Science, abo4282, this issue p.527 
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Embracing the nightlife 


ome animals expertly adapt to urban 
environments, often by changing 
their behavior. Gallo et al. investigated 
whether mammals in cities change the 
timing of their activities to earlier or 
later in the day. Using images from camera 
traps in 10 US cities, the authors analyzed 
how mammal activity changed across 
gradients in human population density, 
impervious surface cover, temperature, and 
vegetation. Coyotes were more nocturnal in 
more populated areas. Common omnivore 
and herbivore species changed their activity 
with urbanization, although some became 
more nocturnal and some less. Coyotes 
and red foxes avoided hot temperatures by 
increasing activity at dusk. Such temporal 
shifts can allow animals to exploit resources, 
avoid conflict, and navigate urban environ- 
ments. —BEL eLife 11, e74756 (2022). 


Urbanized areas attract certain wild species, 
many of which have changed their behavior 
to exploit new opportunities, like this raccoon 


foraging in a garbage bin at night. 


Loading the dice for 
active learning 


The benefits of active learning 
have been well established, but 
traditional lecture-orientated 
approaches are still the norm 

at many institutions. Yik et al. 
implemented large-scale data 
collection (from 2382 instructors) 
to empirically test the relative 
association of 17 malleable fac- 
tors that influence the uptake of 
active learning. Using regression 
analysis and multilevel modeling, 
the team found that contextual 
factors (e.g., classroom setup) 
and personal contexts (e.g., 
participation in scholarship of 
teaching and learning activities) 
were associated with a decrease 
in the percentage of time spent 
lecturing. Broad recommen- 
dations for sustaining active 
learning strategies are discussed, 
including constructing classroom 
spaces that support active learn- 
ing, incentives for professional 


development, and meaning- 
ful recognition of scholarship 
of teaching and learning work 
accomplished by instructors. 
—MMc 

Int. J. STEM Educ. 9, 15 (2022). 


Photocontrol fluttering 
controls polymerization 


Most methods for pausing 

and resuming polymerization 
reactions act at the growing 
chain. Aoki et al. show how light- 
induced conformer changes of a 
monomer switch the kinetics of a 
supramolecular polymerization. 
A cyclooctatetraene derivative, 
formed from fused thiophene 
molecules that bear hydrogen- 
bonding side chains, has a chiral 
nonplanar central ring that can 
racemize through a fluttering 
motion. This molecule polymer- 
izes in the dark as alternating R 
and S monomers but not during 
prolonged ultraviolet irradiation. 
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In the dark, the fluttering motion 
is much slower than chain 
elongation, but under irradiation, 
fluttering is 11 times faster. This 
motion inhibits both nucleation 
and polymerization until the solu- 
tion is returned to the dark state 
and provides a photoswitch for 
polymerization. —PDS 

J.Am. Chem. Soc. 10.1021/ 

jacs.2c02176 (2022). 


Mind the genome gaps 
The Cancer Genome Atlas 
(TCGA) is a major resource in 
cancer genomics, encompass- 
ing sequencing data for 33 
different human cancer types. 
Unfortunately, ethnic minority 
patients are underrepresented 
in TCGA, even though they 
often have a higher cancer 
risk. Wickland et al. examined 
TCGA data for self-reported 
Black patients with seven tumor 
types and discovered that for 
six of these cancers, the tumor 
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genomes of these patients were 
sequenced at a lower depth than 
tumor genomes of those with 
European ancestry. In some 
cases, this happened because the 
genomes of most patients with 
African ancestry were sequenced 
in later batches. Additionally, they 
were more likely to have insuf- 
ficient coverage in parts of the 
exome, likely because the human 
reference genome is biased 
toward European ancestry. These 
findings highlight important gaps 
to address in future sequencing 
efforts. —YN 

J. Natl. Cancer Inst. 10.1093/ 

jnci/djac054 (2022). 


Unwrapping TDP-43 
Oligodendrocytes continually 
renew and mature to generate 
myelin sheaths that enwrap 
and insulate axons. TDP-43 is 
a protein known to play a role 
in neurodegenerative disease. 
Heo et al. studied TDP-43 loss 


of function in the mouse brain. 
Selective deletion of TDP-43 
from oligodendrocyte precursor 
cells resulted in their rapid death. 
Deletion of TDP-43 at different 
stages of oligodendrocyte matu- 
ration led to different outcomes. 
Early deletion caused degenera- 
tion of mature oligodendrocytes, 
seizures, and premature lethality, 
whereas late deletion spared 
oligodendrocytes and allowed 
seizure-free survival. Both 
manipulations led to thinner and 
fewer myelin sheaths and to inap- 
propriate wrapping of neuronal 
cell bodies and blood vessels. 
Conditional removal of TDP-43 
from oligodendrocytes in adults 
caused hindlimb weakness. Loss 
of TDP-43 led to the missplicing 
of RNAs that encode key genes 
involved in oligodendrocyte mor- 
phogenesis and myelination and 
may underlie abnormal neuronal 
activity and neurodegeneration 
in diverse neurological diseases. 
—SMH 

eLife 11, €75230 (2022). 
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Proton transfer inside 
an optical cavity 


There is considerable interest in 
developing approaches to control- 
ling proton transfer reactions, 
which play a key role in a wide 
range of practically important 
energy conversion processes. 
Using quantum electrodynam- 
ics ab initio methods, PavoSevié 
et al. studied the effect of strong 
ight-matter interactions, which 
can be created by an optical cavity, 
on intramolecular proton transfer 
in malonaldehyde and aminopro- 
penal. The authors showed that 
the optical cavity can increase 
or decrease the rate of proton 
transfer in these two prototypical 
systems, depending on the cavity- 
mode polarization. The present 
work demonstrates that the effect 
of an optical cavity could serve as 
a practical route to controlling pro- 
ton transfer reactions and should 
trigger further experimental devel- 
opments in this direction. —YS 
J.Am. Chem. Soc. 144, 4995 (2022). 


Don’t hold your breath! 


Breathing does more than just 
ensure the oxygenation of an 
animal's tissues. With its dynamic 
and regular mechanical forces, 
breathing has been implicated in 
all sorts of processes, including 
ung development and regulation 
of immune responses. Bai et al. 
used a human lung alveolus chip 
microfluidic device that recapitu- 
ates human airway physiology to 
model H3N2 influenza infection. 
They found that cyclical respira- 
tory motion inhibits viral infection 
by activating the mechanosensi- 
tive ion channel TRPV4, which 
then induces lung epithelium and 
endothelium to produce S100 
alarmin proteins. These media- 
tors bind the pattern recognition 
receptor RAGE, which triggers 
several innate immune response 
programs that help combat virus. 
Intriguingly, a TRPV4 inhibitor was 
able to reduce both inflammation 
and viral loads, suggesting that 
this may be a useful approach for 
the treatment of viral pneumonia. 
—STS 


Nat. Commun. 13,1928 (2022). 
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PAIN 
Trk’ing mutations 
for pain relief 


The nerve growth factor receptor 
tropomyosin receptor kinase A 
(TrkA) is an attractive target for 
alleviating pain, but it also medi- 
ates other signaling pathways 
that are critical to neuronal func- 
tion and survival. To find targets 
specifically within its pain- 
mediating pathways, Moraes 
et al. examined the structural 
consequences of TrkA mutations 
that cause a pain-insensitivity 
disorder. They found mutations 
that disrupted the interaction of 
TrkA with one of its downstream 
effectors, phospholipase Cy. 
Treating mice with a peptide 
designed to similarly disrupt the 
interaction reduced sensitivity to 
painful inflammation. —LKF 

Sci. Signal. 15, eabm6046 (2022). 


DOG GENOMICS 


What is your dog like? 
Modern domestic dog breeds 
are only ~160 years old and 
are the result of selection for 
specific cosmetic traits. To inves- 
tigate how genetics align with 
breed characteristics, Morrill et 
al. sequenced the DNA of more 
than 2000 purebred and mixed- 
breed dogs. These data, coupled 
with owner surveys, were used 
to map genes associated with 
behavioral and physical traits. 
Although many physical traits 
were associated with breeds, 
behavior was much more vari- 
able among individual dogs. In 
general, physical trait heritability 
was a greater predictor of breed 
but was not necessarily a predic- 
tor of breed ancestry in mutts. 
Among behavioral traits, bidda- 
bility—how well dogs respond to 
human direction—was the most 
heritable by breed but varied 
significantly among individual 
dogs. Thus, dog breed is gener- 
ally a poor predictor of individual 
behavior and should not be used 
to inform decisions relating to 
selection of a pet dog. —LMZ 
Science, abk06339, this issue p. 475 
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INFECTIOUS DISEASE 
Another pandemic 


threat? 


Over the past year, a highly 
pathogenic avian influenza virus 
(HPAIv) lineage has caused 
widespread outbreaks and 
deaths in poultry and wild birds, 
as well as some infections in 
humans. Although there is no 
evidence of human-to-human 
transmission, there is concern 
that the prevalence of these 
viruses could result in adapta- 
tion that allows transmission 
between humans. In addition, 
the unprecedented spread and 
frequency in Europe, Asia, and 
North America has had major 
impacts on the poultry industry 
and is a threat to wild birds, 
especially those that are already 
endangered. In a Perspective, 
Wille and Barr discuss the fac- 
tors that have led to outbreaks of 
this HPAIv lineage, the conse- 
quences, and what can be done. 
—GKA 

Science, abo1232, this issue p.459 


CANCER 
Building up to melanoma 


Numerous cancer-causing muta- 
tions have been identified over 
the years, but they do not occur 
in isolation, and it is difficult 
to disentangle the effects of 
individual genetic changes in a 
tumor that contains hundreds 
of mutations. To gain insight 
into melanoma biology and 
facilitate further study of this 
cancer, Hodis et al. began with 
healthy human melanocytes and 
sequentially introduced mel- 
anoma-associated mutations 
using gene editing. The edited 
cells were then grown in mouse 
models, simulating human mela- 
nomas with different mutation 
patterns. The authors examined 
the genomics, histology, and 
biological behavior of the result- 
ing tumors and compared them 
with naturally occurring human 
tumors, demonstrating the rel- 
evance of their approach. —YN 
Science, abi8175, this issue p_.474. 
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PATHOGEN EVOLUTION 
Immunity-induced 
evolution 


Population immunity is thought 
to drive pathogen evolution- 
ary dynamics, but this process 
can be hard to measure. 
Considering the switch from 
whole-cell vaccines to acellular 
vaccines (ACVs) as a natural 
experiment, Lefrancq et al. ana- 
lyzed the diversity of Bordetella 
pertussis genome isolates 
from five continents to infer 
local changes in strain fitness 
after ACV introduction. Their 
analysis provides evidence that 
ACVs have led to the increased 
fitness of strains that lack 
the virulence factor pertactin 
and may have implications for 
understanding asymptomatic 
B. pertussis infection. —CAC 
Sci. Transl. Med. 14, eabn3253 
(2022). 


BATTERIES 
Observations 
of cathode evolution 


The loss of capacity ina 
rechargeable battery can be 
due to changes in the elec- 
trode structure that occur with 
cycling. Li et al. used hard x-ray 
holotomography to visualize 
the structure of a nickel-rich 
LiNi,,Mn, ,Co,,0, composite 
cathode (see the Perspective 
by Xiao). They were able to 
track the behavior of thousands 
of individual particles with 
time and thus determine the 
relationship between structure 
and performance as well as the 
deterioration of the cathode at 
a size scale that is not gener- 
ally accessible. They found 
that damage during cycling is 
driven not only by each particle 
but also by its surrounding 
neighbors, although the con- 
tributions shift over time. This 
work suggests ways to better 
design electrodes to maximize 
their performance. —MSL 
Science, abm8962, this issue p. 517; 
see also abo76/0, p.455 


ZEOLITES 
Benzene stretches 
zeolite channels 


The pores of zeolites allow 
for selective adsorption of 
molecules based on their size 
and shape. The pore sizes that 
are calculated based on their 
structure are smaller than what 
would be expected from their 
absorption and chemical reac- 
tion properties, which implies 
that the frameworks must have 
inherent flexibility. Xiong et al. 
used environmental transmis- 
sion electron microscopy to 
image the straight channels of 
ZSM-5 zeolite with adsorbed 
benzene (see the Perspective 
by Willhammar and Zou). The 
pores stretched along the lon- 
gest direction of the confined 
benzene molecules by up to 
15%. This large change was 
compensated for by adjacent 
channels so that the overall 
deformation of the original unit 
cells was less than 0.5%. —PDS 
Science, abn766/, this issue p. 491; 
see also abo5434, p. 457. 


EXTINCTION 

Rising temperatures, 
rising risks 

Climate change brings with it 
the increasing risk of extinction 
across species and systems. 
Marine species face particular 
risks related to water warming 
and oxygen depletion. Penn and 
Deutsch looked at extinction 
risk for marine species across 
climate warming and as related 
to ecophysiological limits (see 
the Perspective by Pinsky and 
Fredston). They found that 
under business-as-usual global 
temperature increases, marine 
systems are likely to experience 
mass extinctions on par with 
past great extinctions based 

on ecophysiological limits 
alone. Drastically reducing 
global emissions, however, 
offers substantial protection, 
which emphasizes a need for 
rapid action to prevent possibly 
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catastrophic marine extinctions. 

—SNV 
Science, abe90339, this issue p. 524; 
see also abo4259, p. 452 


NEUROETHOLOGY 
Bees want, too 


In mammals, the subconscious 
“wanting” of something, such as 
food, is facilitated by a complex 
neurobiological process that 
involves dopaminergic signaling. 
In a eusocial animal, such as the 
honey bee, individuals supply 
food not just for themselves but 
also for their colony. Huang et 
al. looked at the neurobiological 
basis of wanting in honey bees 
(Apis mellifera) and found 
that it is also controlled by a 
dopamine-based signaling 
process, suggesting a shared 
mechanism across millions 
of years of divergence (see 
the Perspective by Garcia and 
Dyer). Furthermore, they found 
that wanting was stimulated in 
bees based on their own indi- 
vidual desire to forage as well as 
observation of the bee waggle 
dance, suggesting the existence 
of a colony-level motivation 
mechanism. —SNV 

Science, abn9920, this issue p. 508; 

see also abp8609, p.456 


EPIDEMIOLOGY 
The madness of dogs 


Rabies is a deadly zoonotic 
disease that causes tens of 
thousands of deaths every year, 
mainly among African and Asian 
children. It is usually transmit- 
ted to humans by dogs, and 
once symptoms appear, it is 
inevitably fatal. Despite control 
efforts, rabies continues to 
circulate at very low prevalence 
in Serengeti district, Tanzania, 
where Mancy et al. have been 
tracking cases in several spe- 
cies for more than a decade to 
understand the dynamics of the 
virus. Highly resolved genetic 
data allowed the authors to trace 
transmission networks, from 
which individual dog behavior 
emerges as a key factor (see 
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the Perspective by Antolin). 
Some infected dogs may travel 
long distances and introduce 
new lineages into neighboring 
communities, whereas others 
may simply bite other animals. 
Because dogs are mobile, culling 
fails, and the only resort is fully 
comprehensive dog vaccina- 
tion. —CA 

Science, abnO713, this issue p. 512; 

see also abo7428, p. 453 


ORGANIC CHEMISTRY 
Additive improvements 
to nickel catalysis 


It often takes decades of incre- 
mental optimization to apply 
chemical reactions beyond 
the small range of substrates 
studied at the discovery stage. 
Prieto Kullmer et al. sought to 
accelerate that optimization 
process by screening a large, 
diverse group of additives to a 
cooperative nickel-photoredox 
catalyst system. The screen 
revealed that phthalimides 
substantially expand the func- 
tional compatibility of the nickel 
catalyst and thus the substrate 
scope. The phthalimide appears 
to stabilize oxidative addition 
complexes as well as break up 
deactivated catalyst aggregates. 
—JSY 

Science, abn1885, this issue p.532 
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INTRODUCTION: Cancer develops from cells that 
become malignant because of mutations in 
multiple genes—often accumulated over long 
time periods—that produce a phenotypic diver- 
sity across patient tumors. Specific genetic al- 
terations in particular cancer types have been 
linked to prognosis, to response or resistance 
to therapies (especially targeted therapeutics), 
and to a tumor’s propensity to acquire further 
mutations, among other phenotypes. However, 
genotype-phenotype connections are challeng- 
ing to infer in patients, as any two tumors differ 
genetically in too many ways to isolate the 
effect of one or several mutations. The ability 
to systematically connect cancer-associated 
mutations or combinations thereof with their 
phenotypic consequences would advance our 


Cancer genotype engineering 
Building human melanoma models from melanocytes 


, @ 


understanding of the mechanisms of cancer 
pathogenesis and genetically linked disease 
features. 


RATIONALE: We reasoned that genome editing 
and the fitness advantage of cancer-associated 
mutations could be leveraged to generate hu- 
man cellular models of tumor development. 
Such genome-edited models would replicate 
the precise genetics, lineage relationships, and 
stepwise progression of cancer and allow us 
to establish genotype-to-phenotype links ina 
controlled experimental design. While similar 
models have been realized for tumors origi- 
nating from self-renewing stem cells as cells of 
origin, specifically in colorectal cancer, no com- 
parable models exist for tumor types that arise 
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Genome-engineered human cell models connect melanoma genotypes to phenotypes. Sequential, 
precise gene editing of human melanocytes produced a series of melanoma models. The editing approach we 
developed—without selection markers or single-cell cloning—is applicable to other cell types, expanding 

the toolkit of available cancer models. Through in-depth phenotypic characterization of in vitro cells and 

in vivo tumors from xenografts in immunodeficient mice, multimutant melanoma genotypes were causally 


linked to specific phenotypes. 


474 29 APRIL 2022 « VOL 376 ISSUE 6592 


from nonstem differentiated cells. We present 
an approach that starts from the nonstem cell 
of origin of melanoma, the healthy human 
melanocyte; this approach then generates a 
series of cells with precise genome editing of 
mutations in key cancer genes, thus expand- 
ing the horizon of possible cellular models of 
cancer development. 


RESULTS: We generated a progressive series 
of genome-edited human models of melanoma. 
We started from healthy human melanocytes 
and introduced, in a stepwise fashion, muta- 
tions in up to five genes spanning six pathways 
commonly dysregulated in melanoma: CDKN2A 
(part of the RB pathway), BRAF (MAPK), TERT 
(telomerase), PTEN (PI3K/AKT), TP53 (p53), 
and APC (Wnt), for a total of nine genetically 
distinct cellular models. We characterized these 
models during growth in vitro and after intra- 
dermal injection through mouse xenografts, 
using physiological assessment, histopathol- 
ogy, and single-cell RNA sequencing (scRNA- 
Seq), leveraging computational methods and 
machine learning algorithms. Through these 
models, we connected melanocyte genotypes 
to phenotypes such as gene expression programs, 
replicative immortality, malignancy, rapid tu- 
mor growth, tumor pigmentation, metastasis, 
and histopathological features. In vitro, con- 
secutive mutations produced an ordered pro- 
gression through expression space. In vivo, 
mutations in malignant cells also affected the 
cell-type composition and expression states of 
tumor-infiltrating microenvironment cells. Our 
melanoma models shared genotype-associated 
expression programs with patient melanomas 
and partially recapitulated patient melanoma 
genotype-associated histopathological features. 


CONCLUSION: The genotype-phenotype con- 
nections we identify highlight how the impact 
of mutations often depends on genetic context. 
Such genetic epistasis makes understanding 
the phenotypic consequences of the mutational 
landscape of human cancers a combinatorial 
problem whose study requires modeling strat- 
egies that can scale to multiple mutations, such 
as the one presented in this study. Genome- 
edited human models of cancer enable the 
identification of causal relationships be- 
tween defined sets of genetic alterations 
and disease-relevant phenotypes, further- 
ing understanding of how cancer mutations 
help give rise to the diverse and varied pheno- 
types of human malignancy. 
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Stepwise-edited, human melanoma models reveal 
mutations’ effect on tumor and microenvironment 


Eran Hodis’?>*++8, Elena Torlai Triglia't, John Y. H. Kwon', Tommaso Biancalani'#, 

Labib R. Zakka*°, Saurabh Parkar’, Jan-Christian Hiitter'#, Lorenzo Buffoni’, Toni M. Delorey’, 
Devan Phillips'#, Danielle Dionne’, Lan T. Nguyen’, Denis Schapiro™°4], Zoltan Maliga®, 
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Establishing causal relationships between genetic alterations of human cancers and specific phenotypes of 
malignancy remains a challenge. We sequentially introduced mutations into healthy human melanocytes 

in up to five genes spanning six commonly disrupted melanoma pathways, forming nine genetically distinct 
cellular models of melanoma. We connected mutant melanocyte genotypes to malignant cell expression 
programs in vitro and in vivo, replicative immortality, malignancy, rapid tumor growth, pigmentation, 
metastasis, and histopathology. Mutations in malignant cells also affected tumor microenvironment 
composition and cell states. Our melanoma models shared genotype-associated expression programs 
with patient melanomas, and a deep learning model showed that these models partially recapitulated 
genotype-associated histopathological features as well. Thus, a progressive series of genome-edited human 
cancer models can causally connect genotypes carrying multiple mutations to phenotype. 


ealthy human cells become cancerous 
through the acquisition of genetic muta- 
tions. Efforts to sequence the genomes of 
human cancer cells have illuminated the 
daunting array of mutation combina- 
tions that can cause life-threatening malig- 
nancies, even when they arise from the same 
cell type of origin (/, 2). Additionally, a great 
phenotypic diversity, both within and between 
patients, is caused in part by the somatic mu- 
tations observed in these intricate genetic maps 
of cancer (/, 2). For example, certain genetic 
differences may explain why some cancer cells 
are more prone to metastasize (3, 4), some are 
less susceptible to immune attack (5), and others 
have genomes that are more likely to accumu- 
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late chromosomal alterations (6). Thus, linking 
maps of cancer mutations to disease-relevant 
phenotypic consequences advances our under- 
standing of cancer biology and may inform 
the design of genetically-targeted therapies. 
However, genotype-to-phenotype connec- 
tions are not easily revealed by comparing 
individual human cancers, as any two patient 
tumors or cell lines typically differ genetically 
in too many ways to distinguish the effect of a 
single mutation or a particular combination of 
mutations (7-9). Furthermore, early stages in 
cancer development are rarely represented in 
patient-derived tumors and cell lines (0, 17). 
One solution—made possible by recent ad- 
vances in mammalian genome editing (12, 13)— 
is to use human cell models to replicate, in an 
isogenic fashion, precise multimutant genet- 
ics, cell lineages, and stepwise progression of 
cancer. Such human models have been real- 
ized for colorectal cancer through the use of 
stem cells and timed withdrawal or addition 
of mutation-matched growth factors or chem- 
icals (14, 15). However, there remains a need 
for an approach that does not depend on 
foreknowledge of selective pressures nor on 
single-cell cloning, thus enabling generalized 
multimutation modeling of nonstem cells. 
Melanoma provides a prime case in point 
for multimutation cancer modeling. Its ge- 
netic landscape is both well-charted and com- 
plex as a result of sunlight-induced DNA 
damage (8, 16-26). Despite their complexity, 
nearly all melanomas arising in hair-bearing 
skin have genetic alterations in the retino- 
blastoma tumor suppressor (RB) pathway, the 
mitogen-activated protein kinase (MAPK) path- 
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way, and in telomerase regulation (20, 27). 
These three molecular pathways are most com- 
monly dysregulated by inactivating mutations 
or loss of CDKN24A, an activating mutation in 
BRAF, and one of two point mutations in the 
TERT promoter, respectively. Melanoma pro- 
gression is further associated with mutations 
in many different pathways, including the 
phosphatidylinositol-3-kinase (PI3K)/Akt path- 
way (mutated in ~25% of thick melanomas), 
the p53 pathway (~25% of thick melanomas), 
and the Wnt pathway (APC alterations in ~2 to 
7% of melanomas and CTNNBI mutations in 
~5% of melanomas) (16, 27). Moreover, the mela- 
noma cell of origin—the melanocyte, a pigment- 
producing skin cell—is known and readily 
grown in culture. Additionally, primary human 
melanocytes are amenable to genome editing, 
and a single melanoma-associated mutation 
can undergo positive selection in standard 
melanocyte culture conditions (28). Recently, 
pioneering work has taken the first steps to- 
ward genome-engineered melanomas by intro- 
ducing inactivating CDKN2A and activating 
[Val®°°+Glu (V600E)] BRAF mutations into 
human melanocytes (28). Taken together, these 
features make melanoma a compelling case 
study for the development of multimutation 
cell models. We took an engineering approach 
that leverages advances in genome editing 
and the fitness benefit conferred by cancer- 
associated mutations to generate a collection 
of multimutation primary cell models, which 
we characterized molecularly and phenotypically. 


Mutation fitness advantage enables multistep 
genome editing 


We developed a strategy to sequentially intro- 
duce different series of cancer-associated muta- 
tions into healthy, differentiated human cells in 
culture (Fig. 1A). Exploiting the fitness advan- 
tage of cancer-associated mutations (29), we 
repeatedly selected for desired mutations at 
the endogenous gene loci by waiting for the 
faster-growing mutant cell population to out- 
compete nonmutant cells over time in culture, 
without selection markers or single-cell clon- 
ing. To introduce each precise mutation, we 
delivered the necessary genome editing ma- 
chinery to cells in vitro by electroporation of 
Cas9 ribonucleoprotein (RNP) complex and 
then monitored the mutant allele frequency 
in the cell population over time under standard 
cell culture conditions (30). Once the mutant 
alleles surpassed the nonmutant alleles in 
frequency—indicating that the mutant cells 
had outcompeted the nonmutant cells and 
become the predominant population—we in- 
troduced the next mutation. We repeated the 
entire process multiple times to sequentially 
introduce up to five cancer-associated muta- 
tions per cell model. 

We created a melanocyte genome-editing 
tree guided by both human melanoma genetics 
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Fig. 1. Fitness advantage of cancer-driving mutations enables the 
creation of a progressive series of genome-edited human cancer models. 
(A) Experimental approach for introducing sequential melanoma mutations 
into the genomes of primary human melanocytes with CRISPR-Cas9. RNP, 
ribonucleoprotein; AAV, adeno-associated virus. (B) Editing tree showing the 
nine isogenic models of melanoma generated (boxes), the perturbed genes in 
each model (white boxes), the genotype abbreviation (beige boxes), and the 
molecular pathway dysregulated by the most recent genome edit (red text). 
(C to E) Sequential introduction of first three mutations by CRISPR-Cas9 
genome editing of wild-type (WT) melanocytes. (C) First mutation: CDKN2A 
(“C"). (D) Second mutation: BRAF (“B"). (E) Third mutation: TERT (“T"). 


frequencies of each engineered mutation (y axis) shown over time (x axis). 

#, measurement of allele frequency discontinued because of cell senescence. 
(F) Addition of the -124C>T TERT promoter mutation activates TERT expression. 
Mean of log 10 number of TERT and B-actin (ACTB) transcripts (y axis) 
measured by qPCR in CB (black) and CBT (red) cells. Error bars, SD. n = 3. 
* P < 0.01, one-tailed, one-sample Student's t test. (G to 1) Introduction 

of fourth mutation into CBT melanocytes. (G) Allele frequencies of knockout 
of PTEN (“P"), (H) knockout of TP53 (“3”), and (I) knockout of APC (“A”). 
(J and K) Introduction of fifth mutations into CBTP melanocytes (J) Allele 
frequency of knockout of PTEN and (K) knockout of TP53. Allele frequencies 
(y axis) shown over time (x axis), as assessed by indels in the respective 


TERT editing confers replicative immortality to CB m 


and technical complexity (Fig. 1B), by sequen- 
tially introducing mutations into the RB 
pathway, then the MAPK pathway, and then 
the telomerase regulation pathway through 
precise editing of healthy human melanocytes. 
We knocked out the CDKN2A (“C”, RB path- 
way) by electroporating a genome-editing Cas9 
RNP targeting the CDKN2A locus (exon 2, 
shared by both of its protein products, p16 
and p14) (30). Small insertions and deletions 
(indels) in the gene underwent positive selec- 
tion in culture, reaching 90 to 95% mutated 
allele frequency by day 3 and 98 to 99% by 
day 42 [Fig. 1C; mutated allele frequency 


elanocytes. Allele loci in genomic DNA. 


the two predominant alleles were a 79- and 
53-base pair (bp) deletions between the guides 
of pairs 1 and 2 at 88 and 75%, respectively]. 
We next introduced the BRAF V600E mutation 
(“B”, MAPK pathway) into C-edited melano- 
cytes by codelivering Cas9 RNP targeting BRAF 
exon 15 and a homologous DNA donor encod- 
ing the V600E mutation (30). Recombinant 
adeno-associated virus (AAV) was used to 
deliver the DNA donor to overcome the low 
editing efficiency of the single-stranded oligo- 
deoxynucleotide donors (<0.25% at day 6; 
table S1) (37). The BRAF V600E allele fre- 
quency increased from 6% at day 3 to 97% 


quantified as percent of alleles with an indel; 
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at day 155 in culture (Fig. 1D, reflecting a 
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homozygous population and likely indicat- 
ing that, in this context, two V600E alleles 
provide a greater fitness advantage than one 
allele). Finally, we introduced the TERT -124C>T 
promoter mutation (“T”, telomerase regulation) 
into “CB” melanocytes, codelivering a Cas9 RNP 
targeting TERT exon 1 and a homologous DNA 
donor encoding TERT -124C>T. The frequency 
of the -124C>T TERT mutated allele increased 
from 3 to 5% in the first 30 days in culture to 
45% by day 75 and stayed at 41 to 50% for 
more than 300 days of continuous culture (Fig. 
1E; ascertained as predominantly heterozygous 
with a small subpopulation of homozygous 
cells) (30). Although the more common mutation 
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trajectory in human melanoma may be B > 
T— C(J8, 27), we opted to engineer the path 
C— B-—T to defer the more difficult, precise 
genome edits to a stage when the cells would 
be more cancerlike and therefore more ame- 
nable to editing (32) (the B — T — C order 
was not attempted, though previous work has 
demonstrated the feasibility of engineering B 
as the first mutation) (28). Engineering the 
TERT promoter mutation was the most tech- 
nically difficult of the three mutations; it re- 
quired testing 40 different Cas9 guide sequences 
to identify a potent reagent for making double 
stranded breaks near the TERT promoter lo- 
cus (table S2). This difficulty was possibly due 
to the high G:C content or closed chromatin 
state at this locus (33). We termed the result- 
ing triple-mutation cells CBT melanocytes. 

This first progressive series of mutant melano- 
cyte models addressed whether TERT promoter 
mutations turn on TERT expression and confer 
replicative immortality in the appropriate 
genetic and cellular context. Indeed, the CBT 
melanocytes showed TERT expression by quan- 
titive polymerase chain reaction (qPCR), where- 
as the CB melanocytes exhibited none (Fig. IF). 
Furthermore, the CBT melanocytes grew in- 
definitely in culture (>1.5 years), whereas the 
CB cells exhibited morphological signs of se- 
nescence (“fried egg” appearance; fig. S1) and 
stopped dividing by day 100 (Fig. 1E, black 
curve and hash marks), by which point the cells 
had been in continuous culture for ~6 months 
since the original thaw of the wild-type (WT) 
parental melanocytes. We observed compara- 
ble effects on TERT expression and replica- 
tive immortality with the other common TERT 
promoter mutation, -146C>T (21, 22) (fig. S2). 
Thus, either -124C>T or -146C>T TERT pro- 
moter mutation is sufficient to activate TERT 
expression and confer replicative immortal- 
ity upon human melanocytes in the CB ge- 
netic context. 

Because melanoma progression is associated 
with mutations in many different pathways— 
including the PI3K/Akt, p53, and Wnt pathways 
(16, 27, 34)—we explored the effect of subse- 
quent loss-of-function mutations in PTEN (“P”), 
TP53 (“3”), or APC (“A”), respectively (Fig. 1B). 
Indels in each of the fourth targeted genes 
underwent positive selection in culture, reach- 
ing 94 to 99% mutant allele frequency by 
70 days at most, separately yielding CBTP, 
CBT3, and CBTA melanocytes (Fig. 1, G to I). 

Finally, to explore combinations of mela- 
noma progression mutations, we extended the 
P branch of the editing tree by introducing 
either the 3 or A mutation (Fig. 1B). In pro- 
ducing CBTP3 melanocytes, indels in TP53 
increased over time in culture to an allele 
frequency of 96 to 98%, (Fig. 1J). In producing 
CBTPA melanocytes, indels in APC stayed at a 
stable allele fraction of ~75 to 85% (Fig. 1K) 
only to later increase to >99% when grown 


Hodis et al., Science 376, eabi8175 (2022) 


in vivo in xenograft studies (six of six exam- 
ined tumors, see below). Upon introduction of 
each mutation throughout the editing tree, we 
observed the expected functional effect on the 
relevant molecular pathway, confirming the 
activity of the genetic mutations (fig. S3). 

Our genome-edited tree of human melano- 
cytes demonstrates that the fitness advantage 
of cancer-driver mutations can be leveraged 
to generate progressive multimutant mod- 
els from primary, differentiated human cells. 
Overall, we generated melanocytes with up 
to five precise mutations in key melanoma 
pathways, opening the way to investigation 
of genotype-to-phenotype relationships during 
cancer development. 


Consecutive mutations produce ordered 
progression through expression space in vitro 


We related mutations to their expression con- 
sequences by profiling cells of each genotype 
in the editing tree with single-cell RNA se- 
quencing (scRNA-Seq; Fig. 2A) (30). We found 
that as cells harbored increasing numbers of 
mutations, they continuously progressed in 
expression space. We profiled cells in multi- 
plex by labeling cells of each genotype with a 
distinct, DNA-barcoded, cell-surface-protein 
antibody (cell hashing) (35), followed by pool- 
ing of all genotypes to assay all the cells in 
one batch. We retained 11,042 high-quality cell 
profiles, with a median of 999 cells per genotype 
(range: 836 to 2360) (30). Genotype-agnostic, 
unsupervised embedding of the profiles into 
a two-dimensional (2D) space with uniform 
manifold approximation and projection (UMAP) 
followed the topology of the melanocyte edit- 
ing tree (Fig. 2B). WT cell profiles were em- 
bedded next to C cell profiles, which were in 
turn adjacent to CB cells—all forming one con- 
tinuum, with partial overlap between geno- 
types (Fig. 2B). The CBT cells, although still 
adjacent to the CB cells, were connected 
through a narrow transition and were pri- 
marily embedded in a separate cluster of cells 
that included only genotypes with replicative 
immortality. Although the CBT3 and CBTP 
cells were located on either side of the CBT 
cells, the CBTA melanocytes largely diverged 
into their own isolated cluster. Finally, CBTP3 
and CBTPA cells mapped near their parental 
CBTP cells (in an overlapping manner), rather 
than close to the CBT3 or CBTA cells with 
which they share the 3 and A mutations. These 
results suggest that as melanocytes acquire 
sequential cancer-associated mutations, they 
follow an ordered progression through expres- 
sion space. 

The progression of mutant melanocytes 
through expression space coincided with mod- 
ulation of expression programs associated with 
distinct biological processes. We decomposed 
the expression profiles of all single cells jointly 
into expression programs—learned de novo— 
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through consensus nonnegative matrix factor- 
ization (CNMF) (Fig. 2C and figs. S4 and S5) 
(30, 36). The seven programs were used by cells 
across multiple genotypes, capturing both 
the continuity of the transitions and shared 
features between distant genotypes (Fig. 2C). 
We annotated each program by its top asso- 
ciated genes through manual review and gene 
set enrichment as “melanocyte,” “interferon/ 
p53,” epithelial-mesenchymal transition (“EMT”), 
“Myc/mTORCI,” “Myc/mTORC1/Ox-Phos,” and 
cell-cycle-related “S Phase” and “G2/mitosis” 
(Fig. 2D and tables $3 and S4) (30). The melano- 
cyte program was associated with melanocyte 
lineage genes such as DCT, RAB32, TYRP1, 
TRPMI, MITF, and MLANA. The highest usage 
of the melanocyte program was observed in 
WT melanocytes, whereas it gradually declined 
in the early mutant genotypes and was still 
expressed—albeit at lower levels and in fewer 
cells—in all of the quadruple- and quintuple- 
mutant melanocytes except the CBT3 melano- 
cytes. The interferon/p53 program was first 
activated in C melanocytes, reached its apogee 
in CB melanocytes, and was turned off in CBT 
melanocytes, such that activation of telomer- 
ase through the T mutation led to a sharp de- 
crease in program usage. Both interferon and 
p53 have been associated with senescence 
(37-39), which we observed in CB cells in vitro. 
This pattern is consistent with cells under- 
going stress and telomeric crisis as they age, 
with telomerase activation reversing these 
stressors. The EMT program was associated 
with genes related to invasive potential 
(e.g., SERPINE2, TIMP3, FN1, VIM, PMEPAI, 
LGALSI) and was strongly activated in CBT 
melanocytes, notable for the reported link be- 
tween TERT and EMT (40, 41). The program 
was also active in those CB cells that were at 
the phenotypic transition, as well as some of 
the CBT3 and CBTP cells, particularly those 
adjacent to CBT cells. EMT has mostly been 
studied in epithelial cells, and it is unclear 
how it relates to cell motility and metastatic 
capacity in tumors with neural crest origins, 
such as melanomas (42, 43). The Myc/mTORC1 
program was activated in CBTA cells (and in 
some of the CBT3 cells) whereas the Myc/ 
mTORC1/Ox-Phos program was activated in 
PTEN mutant cells (CBTP, CBTP3, and CBTPA). 
Both cell cycle programs were used at a higher 
level in all the genotypes that included the T 
mutation and thus possessed replicative im- 
mortality (fig. S6), whereas among the cells 
without the T mutation, noncycling “GO” C 
and CB cells gradually moved away from the 
more distinct GO WT cells (fig. S6D). Nota- 
bly, many of the programs also matched those 
observed in scRNA-seq of human melanoma 
cell lines (44) (fig. $7). Overall, these results 
demonstrate that melanoma-associated muta- 
tion combinations activate and repress specific 
expression programs that are shared across 
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Fig. 2. Consecutive mutations produce ordered progression through 
expression space and activate shared expression programs. (A) Experimental 


overview to profile gene expression in parallel from cel 


genotypes with hashed scRNA-seq. (B) Gradual progression of cell states with 
genotype. UMAP embedding of melanocyte scRNA-seq profiles (dots) colored and 
labeled (boxes) by genotype (legend). Arrows follow the editing tree (as in Fig. 1B). 
(C) Expression programs. UMAP embedding as in (B) colored by per-cell relative 
usage (color bar) of each of the seven expression programs identified by cNMF. 


genotypes and, in some cases, help explain the 
overall cellular phenotypes. 


Mutation combinations confer distinct, 
disease-relevant tumor phenotypes in vivo 


We next related multimutant genotypes to their 
tumor phenotypes in vivo by injecting each 
immortalized melanoma model into the dermis 
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immatory response 


ls from nine engineered 


of immunodeficient mice and assessing for 
disease-relevant features (Fig. 3A). CBT melano- 
cytes were malignant in vivo and formed slow- 
growing tumors in xenograft models. Between 
67 and 111 days, no primary tumor growth was 
detected [Fig. 3, B and C, (nm = 4) and (n = 8), 
respectively, black curves; and fig. S8 (7 = 8)]. 
However, upon tissue harvest, small nodules 
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(D) Programs reflect key processes and vary across genotypes. Top: Distribution 
of relative program usages (y axis) in single cells of each genotype (x axis, 

color legend). Middle: Aggregate (pseudobulk) expression [Z score of expression 
(logs of transcripts per 10,000 reads, TP10K), color bar] and percent of expressing 
single cells (white circles) of the 15 top program-associated genes (rows) per 
genotype (columns). Bottom: Ranked lists of gene sets (MSigDB hallmark) (83) 
enriched in each program (Mann-Whitney U test, false discovery rate (FDR) < 5 x 10°; 
*EDR < 10° **FDR < 10” 


12). 


were visible at the injection sites. Histologic 
and immunophenotypic evaluation confirmed 
the presence of melanoma cells in these nodules 
(six of six tumors examined; figs. S9 to S12 and 
tables S5 to S7), with half of the nodules also 
displaying adjacent features of a congenital 
nevus (three of six tumors). Over a longer time 
course (2150 days), a small tumor (<14 mm?) 
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Fig. 3. Mutation combinations confer diverse, disease-relevant phenotypes 
in vivo. (A) Experimental approach to identify disease-relevant phenotypes 
caused by engineered mutations in vivo. (B to F) Primary tumor growth 

of xenografted mutant melanocytes in NSG mice compared with CBT or 
CBTP control parental cells (as shown) that received nontargeting Cas9 RNP: 
(B) CBT3, (C) CBTA, (D) CBTP, (E) CBTP3, and (F) CBTPA cells. Top panels: 
tumor size (cubic millimeters, y axis) over time (days, x axis) after two intradermal 
injections, one in each flank. n, number of tumors. Bottom panels: representative 
images of (shaved) mice harboring mutant cells as marked. A ruler with large, 
numbered marks in centimeters is shown for scale. (G and H) Loss of APC 
promotes frequent distant metastases. Average number of individual metastatic 
foci per section (symbols) of lung (G) or liver (H) tissue in a histologic slide 


occasionally became apparent at the injection 
site before tissue harvest [7 of 12 injections; 
Fig. 3D, (n = 8), inset: slight uptick of black 
curve by day 151; and fig. S13, no growth by 
day 189 (n = 4)]. These small melanocytic 
tumors were malignant by dermatopathologic 
evaluation (four of four tumors examined; fig. 
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S14 and table S8). Thus, in line with observations 
in patient melanomas (78), melanocytes with 
common melanoma mutations in the endogenous 
loci of CDKN2A, BRAF, and TERT displayed 
phenotypic characteristics of early melanomas. 
Three alternative quadruple-mutation 
branches of the editing tree—CBT3, CBTA, 


Days after injection of cells 


Days after injection of cells 


(y axis, counted manually) obtained from a single mouse injected with a 
mutant cell line (genotype indicated by color) and collected after the indicated 
number of days (x axis). Each slide had an average of three lung sections 
and two liver sections, each from a different lobe. (I) Injected CBTPA melanocytes 
cause rapid weight loss in mice. Percent of initial mouse weight [y axis, 
determined after subtracting primary tumor weights (estimated at 1 g/cm’) 
from measured mouse weight]) over time (x axis, days). n, number of 

mice. Data in (G and H) are from the four independent experiments in 

(C to F). #, two CBTA mice, one from each guide group, were sacrificed for 
histological inspection (C and |). ##, one CBTPA mouse was euthanized as a 
result of primary tumor ulceration (F and |). *P < 0.01; ns, not significant; 
two-tailed, two-sample Student's t test. 


and CBTP—each had distinct effects on mela- 
noma phenotypes. CBT3 cells did not produce 
visible tumors over a period of ~60 days, al- 
though by day 69 a few injection sites (3 of 16) 
began to show small tumors (<14 mm”; Fig. 3B: 
slight uptick of red curve by day 69). By con- 
trast, CBTA cells formed darkly pigmented, 
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macular (flat) growths by day 10 that advanced 
to slow-growing, darkly pigmented tumors 
by day 29 (Fig. 3C). Finally, CBTP melanocytes 
formed amelanotic tumors in mice that grew 
faster than CBT3 tumors but slower than CBTA 
tumors (compare Fig. 3D with Fig. 3B). In all 
examined quadruple-mutant tumors, histologic 
and immunophenotypic features of the xeno- 
grafted melanoma models resembled those 
of patient melanomas (three of three, CBT3; 
three of three, CBTA; four of four, CBTP; figs. 
S15 to S17 and tables S6 to S8). Our findings 
suggest that in the setting of mutant CDKN24A, 
BRAF, and TERT, loss of APC causes more 
potent progression of human melanomas than 
does loss of either of the more commonly 
mutated genes, PTEN or TP53. 

A fifth engineered mutation led to fea- 
tures of aggressive melanocytic disease. Tu- 
mors formed by CBTP3 melanocytes showed 
a beyond-additive, increased growth rate com- 
pared with both CBTP and CBT3 melanocytes, 
highlighting synergy between the PTEN and 
TP53 mutations (Fig. 3E, compare with Fig. 3, 
B and D, at days 67 to 70, P = 1 x 10° by two- 
sided, modified Welch’s ¢ test) (30). CBTP3 
tumors also showed evidence of emerging 
tumor heterogeneity with tumors that were 
mostly amelanotic (similar to CBTP tumors). 
However, these frequently had contiguous 
sectors of dark pigmentation of varying size 
(fig. S18, see below for analysis of expression 
and genetic heterogeneity of these tumors). 
Tumors formed by CBTPA melanocytes had 
the fastest growth rate of all the engineered 
melanoma models (Fig. 3F), with mice that 
had received the CBTPA melanocytes requir- 
ing euthanasia by day 36 as a result of primary 
tumor burden. Similar to CBTA tumors but in 
contrast to CBTP tumors, CBTPA tumors were 
uniformly darkly pigmented (Fig. 3F and fig. 
S18C). This is in line with phenotypes of deep 
penetrating nevi (DPN) and DPN-like mela- 
nomas, in which Wnt pathway mutations have 
been associated with increased pigmentation 
(34). Both quintuple-mutant genotypes re- 
sembled patient melanomas by histologic 
and immunophenotypic features (four of four, 
CBTP3; four of four, CBTPA; figs. S19 and S20 
and tables S9 and S10). 

Metastatic propensity was also associated 
with tumor genetics. CBTP tumors yielded a 
small number of lung metastases by day 151 
(Fig. 3G), whereas CBTA cells metastasized to 
both the lung and liver (two common sites of 
melanoma metastasis) by day 111 (Fig. 3, G and 
H), as well as to other organs (fig. S21). Tumors 
formed by CBTPA melanocytes readily meta- 
stasized to visceral organs, with numerous 
metastases visible in the lungs and liver by 
day 36 (Fig. 3, G and H, and fig. $22). Rapid- 
onset weight loss, another characteristic 
of aggressive disease, was also apparent by 
day 19 after xenograft injection (Fig. 31). 
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Together with our observations of metas- 
tasis in the CBTA model (Fig. 3, G and H, and 
fig.S21), our findings point to loss of APC as 
a major cause of metastatic disease in this 
genetic context. This is likely attributable to 
Wnt pathway activation, although its role in 
melanoma metastasis has been an open quest- 
ion (26, 45-47). The CBTPA melanocyte mod- 
el also shows that as few as five mutant genes 
are sufficient to produce aggressive, metastatic 
human melanoma, at least in an immuno- 
deficient host. 

To test for the possibility that additional 
mutations had accrued during the process of 
engineering and cellular proliferation, we se- 
quenced the genome of a CBTPA tumor and 
compared it with the parental WT melanocyte 
genome to identify somatic events. We did not 
find mutations of apparent in vivo phenotypic 
consequence beyond those we had introduced. 
Notably, we did identify a clonal, two-fold 
tandem duplication of the melanocyte line- 
age transcription factor MITF (table S11 and 
fig. S23) (30)—a gene amplified in 5 to 10% of 
melanomas (8, 24, 48)—but it had no major 
observed phenotypic consequences (30). The 
spontaneous duplication of a gene frequently 
amplified in melanomas underscores the simi- 
larity of our cell models to melanomas arising 
in humans. We identified no further somatic 
alterations of known cancer association, with 
no additional chromosomal segment amplifi- 
cations or deletions (fig. S23), only 12 clonal, 
nonsilent somatic point mutations (not includ- 
ing engineered mutations; table S12 and figs. 
$24 to S28), and only one structural variant 
(deletion of RIC8B; table S11). These findings 
reduce the possibility that spontaneous, un- 
planned mutations are responsible for the 
phenotypic differences observed between our 
engineered model genotypes; however, in the 
absence of deep sequencing of all model geno- 
types at the time of their phenotypic charac- 
terization, contributions from such mutations 
cannot fully be excluded. 

Taken together, our results establish causal 
relationships between disease characteristics 
and six different combinations of melanoma 
mutations in human melanocytes. These results 
also demonstrate that genome-edited melano- 
cytes recapitulate important aspects of tumor 
development in vivo. 


Genotype-driven intrinsic tumor cell 
expression programs in vivo 


To assess the in vivo cellular phenotypes caused 
by melanoma mutation combinations, we next 
performed scRNA-Seq on tumors from our 
xenografts (Fig. 4A) (30). Because each tumor 
consists of an intricate ecosystem of melano- 
cytic tumor cells (of human origin) along with 
stromal and immune cells within the tumor 
microenvironment (of mouse origin), we inves- 
tigated the effects of mutations on different cell 
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types separately. We first computationally dis- 
tinguished tumor cells as those cells whose se- 
quencing reads predominately mapped to the 
human genome (fig. S29) (30). For tumor cell 
analysis, we retained 26,964 high-quality tumor 
cells, with a median of 1609 (range: 31 to 4999) 
cells per sample across three replicate tumor 
samples of each of the CBTP, CBTA, CBTP3, 
and CBTPA tumors grown in mice for ~1 to 
2 months and two replicate tumors for CBTP 
tumors grown for ~6 months (fig. S30 and 
table S13) (30). 

The expression profiles of mutant melano- 
cytes isolated from in vivo tumors were grouped 
predominantly by genotype. In a genotype- 
agnostic, unsupervised 2D UMAP embedding 
of the profiles, melanocytes from CBTA, CBTPA, 
and 2-month-old CBTP tumors formed one 
cluster per tumor (Fig. 4B). Melanocytes from 
CBTP3 tumors formed two clusters per tumor, 
and melanocytes from the 6-month-old CBTP 
tumors formed either two or three clusters 
per tumor (Fig. 4B). In addition, hierarchical 
clustering of the UMAP cell clusters by the 
Pearson correlations of their mean expres- 
sion profiles showed that cell clusters from 
CBTA and CBTPA tumors were mostly grouped 
by genotype, whereas 6-month-old CBTP and 
CBTP3 tumors were partitioned into two clades, 
one of which was particularly distinct (fig. S31). 
To assess whether spontaneous genetic changes 
could be driving the emerging within-sample 
clusters, we inferred chromosomal copy number 
alterations (CNAs) from single-cell expression 
profiles, as previously demonstrated for hu- 
man tumors (49) (fig. S32). If CNA acquisi- 
tion was the initiating event for formation of 
supernumerary within-sample clusters, then 
each supernumerary cluster should demon- 
strate distinct and clonal CNAs. This was the 
case for CBTP3 tumor clusters (fig. S32), but 
not for the within-sample clusters of 6-month- 
old CBTP tumors, which did not show CNAs 
(fig. S32). [Note that most of the CBTP3-specific 
CNAs were not common patient-derived mela- 
noma CNAs, though they sometimes included 
genes often mutated in melanoma patients) 
(16, 24, 27).] Thus, although engineered geno- 
type was the main driver of expression differ- 
ences between models in vivo, subpopulations 
of expression states emerged either with time 
(6-month-old CBTP tumors) or more rapidly 
as a consequence of additional genotype changes 
(CBTP3 tumors). 

We identified eight expression programs 
active in mutant melanocytes in vivo and 
used by cells across multiple genotypes (Fig. 4C 
and fig. S33) (30, 36), which we annotated as 
“ribosomal,” “Ox-Phos,” “interferon/TGF8,” 
“EMT,” “B-catenin/MITF,” “interferon/TNFa/ 
hypoxia,” “protein secretion,” and “cell cycle” on 
the basis of the top genes and gene sets asso- 
ciated with each (Fig. 4D and tables S14 and S15) 
(30). The programs Ox-Phos and f-catenin/ 
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per genotype (columns). 

Bottom: Ranked lists 

of gene sets (MSigDB hallmark) (83) enriched in each program (Mann-Whitney U test, FDR< 10°; *FDR < 107°: **FDR < 10°°°). (E) Correspondence of in vivo 
and in vitro programs. Significance of overlap [-logi9(P value), Fisher's exact test, colorbar] of top 50 associated genes between in vivo (rows) and in vitro 
(columns, as in Fig. 2) programs. Only overlaps with P value < 10°° are shown to account for multiple hypothesis testing. (F) Correspondence of in vivo 
programs and programs in patient melanomas (16). Significance of overlap [-logio(P value), Fisher's exact test, colorbar] of top 50 associated genes between 
in vivo (rows) and patient (column) programs. Only overlaps with P value < 10° are shown to account for multiple hypothesis testing. Associations of 
expression programs with either p53 or Wnt pathway gene mutations are noted. (G) Similar usage of melanoma (16) and in vivo model programs across in vivo 
melanoma model single-cell profiles. UMAP embedding [as in (B)], colored by per-cell relative usage of patient melanoma expression programs (right) or 
sums of relative usages of in vivo melanocyte expression programs (left). 
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MITF [also associated with TRPM1, CCND1, 
TYRP1, MLANA, and CDH1 (E-cadherin)] were 
mostly used by tumors with Wnt pathway ac- 
tivation (APC mutation). The interferon/TGF8, 
the interferon/TNFa/hypoxia, and the EMT 
programs were associated with tumors with 
p53 pathway inactivation (7P53 mutation). 
Furthermore, the interferon/TGFB program 
predominantly characterized cells from a sin- 
gle within-sample cluster in each of the CBTP3 
and 6-month-old CBTP tumors (described 
earlier; with only one CBTP3 cluster having 
the aforementioned clonal CNAs; Fig. 4, B 
and C; program 3), suggesting that these clus- 
ters reflect a shared inflammatory expression 
state, despite their tumor and genotype differ- 
ences. Most in vivo programs were similar to 
one or a combination of programs observed 
during in vitro culture, as judged by the over- 
lap of top associated genes (Figs. 2, C and D, 
and 4E), consistent with a largely cell-intrinsic 
origin. However, incomplete overlaps and dif- 
ferences between program usages may also re- 
flect interactions with the microenvironment. 


Genetically linked tumor expression programs 
are shared with patient melanoma tumors 
with matching genetic associations 


Expression programs identified in mutant 
melanocytes in vivo matched known programs 
seen in patient melanomas with correspond- 
ing genetic associations. Previous studies 
identified three cancer cell gene expression 
programs in bulk mRNA expression profiles 
of patient melanomas (J6): (i) an MITF-low 
program (24, 50) associated with low expres- 
sion of MITF; (ii) an OxPhos program as- 
sociated with genes involved in oxidative 
phosphorylation and differentiated mela- 
nocytes and with low levels of hypoxia- 
associated genes, and (iii) a less well-defined 
Common program shared by MITF-low and 
OxPhos tumors and associated with increased 
expression of MITF and interferon signaling 
genes. The Ox-Phos and B-catenin/MITF in vivo 
programs (highly used by the darkly pigmented 
CBTA and CBTPA tumors) showed substantial 
overlap of top genes with the OxPhos program 
from patient melanomas, which was itself en- 
riched in patient tumors with mutations in 
Wnt pathway genes and high levels of pig- 
mentation (Fig. 4F, Fisher’s exact test, top 50 
associated genes, P = 1 x 10° and 7 x 107”, 
respectively) (76), thus matching the molecular 
program, its genetic association, and tumor phe- 
notype. Similarly, the EMT, interferon/TGF8, 
and interferon/TNFa/hypoxia in vivo programs 
(highly used by CBTP3 and 6-month-old CBTP 
tumors) shared top genes with the patient 
melanoma MITF-low program (P = 4 x 10°, 
7x 1077, and 2 x 107), which was itself en- 
riched in patient tumors with 7P53 mutations 
(Fig. 4F) (76). Finally, the ribosomal in vivo 
program—mostly used by 2-month-old CBTP 
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tumors and not observed in vitro—overlapped 
with the Common program from patient mela- 
nomas (P = 7 x 10~), suggesting that it captures 
expression features present in patient samples 
but also shares top genes with interferon/TGFB 
and Ox-Phos (P = 1 x 10° and 2 x 10°*), two 
in vivo programs that overlapped both MITF- 
low and OxPhos patient melanoma programs, 
respectively (Fig. 4F). 

Moreover, in engineered melanocyte cells 
from in vivo tumors, the individual activities 
of the patient melanoma programs OxPhos 
and MITF-low (derived from bulk tumors) 
closely matched those of the combined in vivo 
single-cell programs: (i) Ox-Phos, B-catenin/ 
MITF and protein secretion, and (ii) EMT, 
interferon/TGF§, and interferon/TNFo/hypoxia, 
respectively (Fig. 4G). Furthermore, usage of 
the OxPhos and MITF-low patient melanoma 
programs in our models showed intratumoral 
heterogeneity (Fig. 4G, right column), as has 
been observed in scRNA-seq of patient mela- 
nomas (with programs comparable to those 
identified here, fig. S34) and in cell lines (49). 
Overall, our engineered melanocyte models 
recapitulate the expression states and genetic 
associations in patient melanomas and sug- 
gest that the expression programs described 
to date in patient melanomas are a composite 
of coincident, biologically distinct programs 
that are not merely associated with specific 
gene mutations but rather are caused by them, 
albeit with intratumoral variation. 


Tumor genotypes shape the composition and 
expression state of infiltrating stromal and 
immune cells 


Next, we estimated the effect of malignant cell 
genotype on the tumor microenvironment by 
analyzing the scRNA-seq profiles of 13,332 
mouse cells (median cells per sample: 576, 
range: 88 to 4043) across all tumors (Fig. 5A) 
(30). The cells spanned immune cell types 
(neutrophils, dendritic cells, plasmacytoid 
dendritic cells, and M1 and M2 macrophages), 
as well as endothelial and epithelial cells, 
pericytes, and cancer-associated fibroblasts 
(Fig. 5B and fig. $35), which we annotated by 
marker gene expression (figs. S36 to S39) (30). 
[T, B, and natural killer (NK) cells are absent, 
as expected in these immunodeficient NOD. 
Ce-Prkde T2rg’”™"/$73 (NSG) mice.] As 
observed in patient tumors (49, 51), profiles 
from cells of the microenvironment are 
primarily grouped by cell type (Fig. 5B). We 
observed tumor age-related changes in cell 
composition among cancer-associated fibro- 
blasts, which shift from more balanced ratios of 
contractile and immunomodulatory expres- 
sion phenotype populations in 2-month-old 
tumors to 97 to 98% of fibroblasts having the 
contractile phenotype in 6-month-old CBTP 
tumors (Fig. 5C), in line with previous obser- 
vations (52). 
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Melanocyte genotype altered the cellular 
composition of the tumor microenvironment 
(Fig. 5C)—most notably neutrophils, which are 
known to be associated with poor early stage 
melanoma prognosis (53). Neutrophils con- 
stituted ~40% of the tumor microenvironment 
cells in CBTA and CBTP3 tumors on average 
but were nearly absent from CBTP tumors of 
any age and constituted only ~2% of the mi- 
croenvironment cells in CBTPA tumors (which 
are also the tumors grown in mice for the 
shortest time because of their fast growth rate) 
[Fig. 5, B and C, and figs. S30, S35B, and S40; 
family-wise sign error rate (Bayesian proxy for 
family-wise error rate controlled P value) < 0.01 
for comparisons of CBTA or CBTP3 versus 
CBTP (2 or 6 months), hierarchical Bayesian 
multinomial logistic mixed effects model (30)]. 
The differences in neutrophil infiltration be- 
tween tumors sharing many of the same 
mutant genes underscore the importance of 
mutation combinations in shaping the tu- 
mor microenvironment. We hypothesize that 
neutrophil infiltration in individual tumors 
may be due to differences in tumor-immune 
cell communication. Supporting this hypoth- 
esis, the melanocytes from the two CBTA 
tumors and one CBTP3 tumor with highest 
neutrophil infiltration [CBTA replicates 1 and 
2 (rep. 1 and 2); CBTP3 rep. 1; fig. S35B] ex- 
pressed the chemoattractant CCL2, known 
to attract and activate neutrophils (fig. S41A) 
(54). Notably, neutrophils in CBTA and CBTP3 
tumors also displayed shifts in distribution 
across different cell states, partly tracking 
with genotype (Fig. 5, D to F, and fig. S35). 
Neutrophils from the two CBTA tumors with 
the highest neutrophilic infiltrate were asso- 
ciated with an expression program previously 
observed in tumor-infiltrating and tumor- 
promoting neutrophils (N5), whereas CBTP3- 
infiltrating neutrophils in the most enriched 
replicate (rep. 1), expressed programs that 
resembled the expression state of healthy- 
tissue neutrophils (N1 and N32) (55) (Fig. 5, D 
to F, and figs. S35 and S41B). 

Melanocyte genotype also influenced the 
cellular state of other immune cells in the 
tumor microenvironment. Examining each 
cell type for genotype-associated expression 
differences, we observed that macrophages 
from different tumor genotypes grouped in 
genotype-related patterns (Fig. 5G). Although 
M2 macrophages—most prominent in CBTA 
tumors—did not show clear tumor genotype- 
specific expression changes, M1 macrophages 
preferentially activated three different expres- 
sion programs depending on tumor geno- 
type (Fig. 5, H and I, fig. S41C, and tables 
S16 and S17): (i) a “complement/ribosomal” 
program, enriched in ribosomal protein genes 
and a subset of complement component genes 
(Clqa, Clqb, Clgc) that were reported to 
increase in response to apoptotic cells and 
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Fig. 5. Tumor genotype shapes tumor microenvironment composition. 


(A) Experimental approach to profile gene expression from mouse cells in tumor 
xenografts with scRNA-seq. (B and C) Remodeling of the tumor microenvironment 
cellular composition by cancer cell genotype and duration in mice. (B) UMAP 
embedding of tumor microenvironment scRNA-seq profiles (dots) colored by 
genotype and replicate (legend) and labeled by cell type. #, CBTP rep. 3 is a mixture 
of four tumors from two mice, whereas all other replicates are from a single tumor. 
(C) Mean (bar) and individual (dots) percent (y axis) of tumor microenvironment 
cells of each type (x axis) in each genotype (color). (D to F) Diversity of neutrophil 
expression programs in tumors of different genotypes. (D) UMAP embedding of 
neutrophil single-cell profiles (dots) from specific tumor genotypes (colored) and all 
other genotypes (gray). (E) UMAP embedding of neutrophil profiles (as in D) 
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highlighting only the neutrophils in CBTA and CBPT3 tumors colored by per-cell 
score of neutrophil expression signatures, N5 (associated with tumor growth), 
and N1N3 (more similar to circulating and healthy-tissue neutrophils); signatures 
previously described (55). (F) Distribution of per-cell neutrophil expression 
program score (y axis) in neutrophils from CBTA (green) and CBTP3 (blue) tumors. 
P value < 0.001 (N5), P value < 0.001 (NIN3), Kruskal-Wallis rank-sum test, 
degrees of freedom = 5. (G tol) Effect of tumor genotype on macrophage expression 
programs. (G and H) UMAP embedding of M1 and M2 macrophage single-cell profiles 
(dots), colored by specific tumor genotypes (G) or by per-cell relative usage of 
macrophage gene expression cNMF programs (H). (I) Fraction of cells (x axis) with 
the highest score in each of four CNMF programs (colors) among MI cells (defined 
as cells with an M2-related program score <0.45) in each replicate (y axis). 
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during macrophage polarization toward the 
M2 phenotype (56, 57); (ii) an “S100/Inter- 
feron” program, enriched in genes related 
to TNFa response through NF-«B (e.g., Cebpb 
and AZ), interferon response (e.g., [fitm1 
and /fitm3), and S100 calcium-binding cyto- 
solic proteins (e.g., S100a4/6/10/11); and (iii) a 
“metabolism/Lgals” program enriched in genes 
involved in cellular metabolism (e.g., Aldoa 
and Ldha) and cell-cell interactions such as 
Lgals1 or Lgals3, also involved in immuno- 
regulatory functions (58). CBTP tumor M1 
macrophages primarily expressed the comple- 
ment/ribosomal program (Fig. 51), whereas 
M1 macrophages from the CBTPA tumors 
mostly expressed the metabolism/Lgals program. 
Lastly, the CBTP3 tumors had macrophage dis- 
tributions matching either the CBTP (CBTP3 
rep. 1 and 2) or CBTPA (CBTP3 rep. 3) pat- 
terns (Fig. 51), perhaps because of genetic 
heterogeneity (CNAs; fig. S32) (30) seen be- 
tween CBTP3 tumors. Finally, M1 macro- 
phages from the CBTA tumors predominantly 
expressed either the S100/interferon program 
or the metabolism/Lgals program. M2-scoring 
cells lie in proximity to M1-like cells that score 
highly for either the complement/ribosomal 
or metabolism/Lgals programs, suggesting 
possible alternative paths from M1 to M2 in 
tumors from different genotypes, depend- 
ing on which of these two programs are used. 
Overall, these results show that tumor mutation 
combinations shape not only the cellular com- 
position of the tumor microenvironment but 
also the cellular state of the individual cell types 
within the microenvironment. 


Tumor histological features are genotype 
specific and coincide with genotype-associated 
expression programs in patient melanomas 


We examined the association between the geno- 
types of mutant melanocytes and microscopic 
tumor appearance by assessing whether geno- 
type can be predicted from histopathological 
images alone, indicating a relationship between 
the two. To this end, we trained a convolutional 
neural network model (59) on hematoxylin 
and eosin (H&E)-stained tumor sections of our 
mutant melanocytes grown as in vivo mouse 
xenografts, supervised by genotype (Fig. 6A). 
We then applied the model to predict the 
probability of each genotype on a per-tile 
(2048 pixels x 2048 pixels) basis, and com- 
bined the predictions (summing per-tile prob- 
ability vectors in a tumor section and selecting 
the genotype with maximum probability) to 
call an overall genotype (CBT, CBT3, CBTA, 
CBTP, CBTP3, or CBTPA), if there was sufficient 
prediction certainty (Fig. 6B; entropy of prob- 
ability vector <0.2) (30). We trained on 56 whole- 
microscope slide images (37%, corresponding 
to 5533 tiles) and tested on 94 (63%, 16,118 
tiles), ensuring that no mouse contributed 
images to both the training and test sets. 
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The model classified 76% of sections and 
had high accuracy [Fig. 6C, area under the 
curve (AUC) range 0.89 to 1.00, compared 
with an AUC of 0.50 resulting from the null 
model, the predictions of which are random] 
(30), with perfect assignment for CBT, CBT3, 
CBTA, and CBTPA tumor sections (Fig. 6D), 
possibly reflecting the within-genotype histo- 
logical homogeneity of these models. The most 
common misclassification was between CBTP 
and CBTP3 tumor sections (28% of CBTP sec- 
tions were classified as CBTP3, and 3% of 
CBTP3 sections were classified as CBTP), 
suggesting overlap in histopathological fea- 
tures between these tumor genotypes (Fig. 6D). 
This may be consistent with the observed ex- 
pression similarity between these malignant 
cells in vitro (Fig. 2B) and in vivo (fig. S31). 
Together, these findings show that mutant 
melanocyte genotypes give rise to distinguish- 
able tumor histologies. 

We then asked whether the distinguishable 
tumor histologies of our melanoma models 
were reflected, to any extent, in patient mela- 
nomas. To this end, we tested whether the 
neural network model trained on our geneti- 
cally distinct xenograft tumors (Fig. 6A) showed 
any predictive signal on patient melanoma 
histology. We first used the model to classify 
histological slides of patient melanomas by 
our six genome-engineered genotype labels 
(CBT, CBT3, CBTA, CBTP, CBTP3, or CBTPA) 
and then grouped all genotype labels contain- 
ing a given mutation to produce a proxy score 
(sum of probabilities of genotypes containing 
the mutation) for loss-of-function mutation 
status of APC, TP53, or PTEN in patient mela- 
nomas (24, 30). Inference of APC loss-of- 
function status resulted in an AUC of 0.58 
(APC or CTNNBI mutations, grouped together 
to increase the number of tumors). However, 
the 95% confidence interval (CI) (30) included 
random predictions (0.49 to 0.66; Fig. 6E), and 
consequently the result was not statistically 
significant. Inference of TP53 or PTEN muta- 
tions approached random prediction [AUC: 
0.52, 0.53 respectively, and 95% CI 0.44 to 0.59 
and 0.44 to 0.61, respectively; Fig. 6E, muta- 
tion annotations as in (J6)]. Despite lacking 
statistical significance, the AUC for predict- 
ing mutations in the Wnt pathway was within 
the range of reported results from models 
trained and tested on patient melanoma his- 
topathology (APC AUC: 0.44 to 0.66, CTNNBI 
AUC: 0.52 to 0.64, TP53 AUC: 0.59 to 0.62, 
PTEN AUC: 0.44 to 0.66) (60, 67). 

Because not all patient melanomas with 
Wnt pathway or 7P53 mutations are readily 
identifiable as such, we next used expression 
programs previously associated with Wnt path- 
way or 7P53 mutations as biomarkers for either 
the mutations themselves or their functional 
effects (16). The model showed a statistical- 
ly significant ability to predict expression 
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programs associated with either the Wnt path- 
way (OxPhos program, AUC: 0.74, 95% CI 0.67 
to 0.81) or TP53 mutations (MITF-low and 
Common programs, AUC: 0.63, 95% CI 0.56 to 
0.70) (Fig. 6E). We corroborated these findings 
by verifying that our model did not exhibit 
predictive power on “wrong” or genotype- 
mismatched labels (e.g., by using APC loss-of- 
function predictions to predict which tumors 
express expression programs associated with 
TP53 mutations) (fig. S42A). 

The fact that our model—trained on histo- 
pathology of genome-engineered melanocytes 
grown in mice—can predict genotype-associated 
expression programs of patient melanomas 
from histopathology alone, even to a limited 
extent, is notable given the genetic complexity 
and heterogeneity of human melanomas. Our 
results suggest that genotype-associated ex- 
pression states are reflected in the histopath- 
ological features of human melanomas, and 
that elements of these features are shared be- 
tween our engineered melanocyte models and 
melanomas arising in patients. 


Discussion 


We have shown that the same fitness advan- 
tage by which cancer mutations drive clonal 
expansions in human tumorigenesis can be 
harnessed to generate multimutant models of 
cancer from primary human cells in a stepwise 
manner. By avoiding single-cell cloning, this 
model building strategy is applicable to can- 
cers arising from differentiated cells, which 
may not have sufficient replication potential 
to grow to large populations from a single cell. 
Because no selection markers are introduced, 
there is no exogenous DNA that could alter 
gene regulation or function. As a result, the 
genome-edited cell models are amenable to 
selection-marker-based experiments, including 
comparative molecular studies, genome-wide 
genetic screens (62-65), and pooled genetic 
perturbations, including those with high con- 
tent readouts (66, 67). Applying such approaches 
in a human context with matched genetic con- 
trols may be of particular use for discovery of 
therapeutic targets that could escape identifi- 
cation in nonhuman models as a result of inter- 
species differences, or in human cell lines or 
patient-derived xenografts because of extensive 
intersample genetic differences. The pheno- 
types we report are a product of one particu- 
lar order of mutations; however, the editing 
strategy lends itself to modeling alternative 
sequential ordering of mutation combinations 
in future studies. Overall, the stepwise, multi- 
mutant, genetically precise nature of these 
human cell models enables study of geno- 
type to phenotype relationships in a context 
that approximates several defining features 
of human cancer pathogenesis. 

The importance of studying combinations 
of mutations when linking cancer genotypes 
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Fig. 6. Tumor genotype leads to distinct histological features that also 
associate with genotype-linked expression states in patient melanomas. 


(A) Computational approach to classify histological sl 


False positive rate 


ides into engineered 


False positive rate 


of samples from a given genotype (y axis) that received each genotype classification 
(x axis). The percentage and number of such predictions are displayed within 
each cell. (E) Inferring genotype and genotype-associated 


expression states in 


genotypes. (B) Test set classification examples. Classification of individual tiles 
(colored squares overlying tissue images), the aggregated classification for the entire 
section (“prediction”), and the true genotype (“genotype”) for three examples. 

(C and D) Successful prediction of genotype from histology in held-out mutant 
melanocyte in vivo tumor section images. (C) Receiver operating characteristic 
(ROC) curves of the prediction false positive rate (x axis) and true positive rate 
(y axis) at each probability threshold for each genotype (color). The area under the 


patient melanomas (from TCGA) on the basis of images of H&E stained tumor 
sections. ROC curves were obtained through: (left) predicting APC loss-of-function 
genotype (a Wnt pathway gene) and setting true positive labels to be either Wnt 
pathway mutants or a Wnt-associated expression program; (middle) predicting 
TP53 loss-of-function genotype and setting true positive labels to be TP53 mutants 
or TP53-associated expression programs; or (right) predicting PTEN loss-of- 
function genotype and setting true positive labels to be PTEN mutants (no 


curve (AUC) is indicated for each genotype in the legend. 


to phenotypes is highlighted by our observa- 
tion of how the effect of a given single muta- 
tion depends on genetic context. For example, 
CBTPA and CBTP3 melanocytes in vitro were 
more similar in expression to CBTP melano- 
cytes than to CBTA or CBT3 melanocytes, sug- 
gesting that PTEN loss modulates the effect 
of APC and TP53 loss on gene expression. In 
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(D) Percentage (color bar) 


another example, a 7P53 mutation produced 
different effects depending on which other 
mutations were present: CBT3 melanocytes in 
vivo did not form sizable primary tumors, but 
CBTP3 tumors grew faster than CBTP tumors, 
reflecting an interaction between PTEN and 
TP53 mutations (68) (although no significant 


association between these two mutations has 
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PTEN-associated expression program available). 


been noted in patient melanomas) (16, 24, 69). 
These nonadditive effects also extend to the 
tumor microenvironment. For example, the 
abundance of neutrophils in CBTA and CBTP3 
tumors and their absence or near absence in 
CBTP and CBTPA tumors was not attributable 
to a difference in a single mutation, as the 
two tumor groups shared many mutations. 
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Future studies may explore the effect on ther- 
apeutic response of the complex genetic inter- 
actions apparent in these melanoma models in 
cell-intrinsic (tumor cell) and -extrinsic (micro- 
environment) ways, especially because resis- 
tance to targeted therapies often cannot be 
explained by single-gene mutations (70, 77). 
Because of genetic epistasis, understanding 
the mutational landscape of human cancer is 
a combinatorial problem and its dissection 
requires modeling strategies that can scale to 
multiple mutations, such as the one presented 
in this study, along with appropriate analytics. 

Several challenges in melanoma research 
can be approached by using our genetically 
precise human models. First, our models can 
help shed light on the molecular basis of me- 
tastasis (77). No mutation has been conclu- 
sively linked to metastasis in melanomas 
(26, 27) and the role of the Wnt pathway is 
debated (26, 45-47). Our human models es- 
tablish a causal link between an activating 
mutation in the Wnt pathway (through ge- 
netic inactivation of APC) and metastasis, 
markedly bolstering complementary lines of 
evidence for the role of Wnt in melanoma 
metastasis, most notably two mouse models 
(43, 72) and genetic evidence from a single 
patient (4). Our observations that APC muta- 
tions undergo positive selection in culture 
suggests that mutations driving metastasis 
may confer a fitness advantage even within 
the primary tumor, which would explain why 
metastasis-specific mutations have not been 
seen in melanomas (26, 27). Moreover, our 
metastatic melanoma models reproducibly 
develop spontaneous metastases with short 
latency, addressing a recognized challenge 
in animal models in which rapid growth of 
primary tumors may not allow sufficient time 
for metastases to develop (7/). This growth 
pattern makes CBTA and CBTPA cells trac- 
table systems for studying key intrinsic and 
extrinsic factors in metastasis, in addition to 
allowing direct comparison of primary tumor 
and paired metastases. Although efforts to 
drug the Wnt pathway in cancer have yet to 
achieve marked clinical success, our well- 
characterized and genetically precise human 
metastatic melanoma models may support 
efforts to reveal melanoma-specific Wnt vul- 
nerabilities amenable to therapeutic interven- 
tion against metastasis, the most lethal aspect 
of malignancy (73). 

Another major challenge in melanoma re- 
search is understanding response and resis- 
tance to targeted therapies (71, 74)—e.g., BRAF 
and MEK inhibitors (75)—in which known 
genetic mechanisms explain ~60% of resis- 
tance (70). Although tumor cell gene expression 
programs are also thought to drive resistance 
(76), models of these programs are needed 
(71, 74). Our human melanoma models reca- 
pitulate the MITF-low expression signature 


Hodis et al., Science 376, eabi8175 (2022) 


associated with drug resistance (77) and dem- 
onstrate varying degrees of its presence across 
different genetic backgrounds (all sharing the 
BRAF V600E activating mutation); our models 
also link this expression signature to TP53 
inactivation. Our models may therefore be 
useful in examining the effect of combinations 
of mutations and their expression states on 
drug response and resistance. Additional- 
ly, precise isogenic models allow for well- 
controlled chemical and genomic knockout 
screens that may reveal approaches for dis- 
favoring the resistance-associated gene expres- 
sion program. 

In the context of understanding the im- 
mune response and the role of age or envi- 
ronmental exposure (77), although our models 
do not capture the adaptive immune system, 
we have linked precise melanoma mutations 
to changes in innate immune cells within a 
given tumor and provide a genetically con- 
trolled system for further study. Future studies 
may explore the integration of our models into 
systems that recapitulate human tissue inter- 
actions, such as 3D skin (78) or skin organo- 
ids (79), the implantation of our cells into 
mice of different ages, or the cells’ exposure 
to ultraviolet radiation. 

Lastly, the approach we present opens the 
door for creation of a broader set of human 
cellular cancer models for melanoma research. 
For example, the editing tree can be expanded 
to characterize the molecular and phenotypic 
consequences of other mutations that are 
commonly identified in patients but are still 
poorly understood, such as those in ARID2 
in the SWI/SNF pathway (8, 23). Furthermore, 
melanocytes from different bodily locations 
and developmental stages (known to affect 
melanocyte expression states) (80) can be used 
as alternative starting points, as can melano- 
cytes from donors of both sexes, varying ge- 
netic backgrounds, or different skin tones. We 
opted to engineer neonatal, foreskin-derived 
melanocytes as they are the most commonly 
used experimental human melanocytes, but in 
the future these can be compared with models 
generated from different types of melanocytes. 
Finally, the same editing approach can be 
applied to generate human models of uveal, 
acral, and mucosal melanomas, diseases for 
which precise cellular models are still lacking 
and targeted therapies are still not available 
(71). Though precise engineering of chromo- 
somal amplifications and deletions that typify 
the latter two subtypes may be more challeng- 
ing (87), an editing approach such as the one 
described here enables efficient testing of can- 
didate driver genes both individually and in 
combinations (77). 

Stepwise genome editing of human primary 
differentiated cells can convert genetic maps 
of cancer into genotype-phenotype understand- 
ing, as we demonstrated through use of a series 
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of progressive human models of melanoma 
development. The isogenic cellular models and 
their associated in vitro and in vivo single-cell 
expression profiles and histopathology images 
are a resource spanning multiple melanoma 
genotypes, including early melanoma pre- 
cursors that are notoriously difficult to obtain 
from patient-derived sources (26). Genome- 
edited human models advance knowledge of 
the genetic basis of human malignancy by 
ascribing causation of malignant phenotypes 
to defined sets of genetic alterations, thus 
allowing for their further study in isogenic 
human models of disease. 


Materials and methods summary 
Engineered melanocytes 


Genome engineering was performed on pri- 
mary human epidermal melanocytes derived 
from the foreskin of a neonatal, lightly pig- 
mented male [ThermoFisher Scientific, Cat. 
C0025C, donor 1583283; classified as nonhuman 
subjects research by Broad Institute Office of 
Research Subject Protection (ORSP-1487)], 
which were cultured at 37°C, 5% COs, and 
5% O. in M254 medium supplemented with 
HMGS-2 melanocyte growth supplement, with 
medium and cells from ThermoFisher Scien- 
tific. Cas9 protein, tracrRNA, and guides were 
purchased from IDT and prepared according 
to manufacturer instructions. Electropo- 
ration was performed with the P3 Primary Cell 
Nucleofector Kit and Nucleofector 4D System 
from Lonza. Recombinant AAV was used to 
deliver a donor DNA template for precision 
editing. 


Mouse xenografts 


Female NSG mice (4 to 6 weeks old) received 
two intradermal injections, one in each flank. 
Tumor size and body weight were assessed 
twice per week. All procedures were performed 
under Massachusetts Institute of Technology 
Committee for Animal Care protocol 0036-01-15. 


Single-cell RNA-Seq 


Cells grown in vitro were processed with 10x 
Genomics Single Cell 3’ v3, with hashing gen- 
erated as described previously (82). For in vivo 
expression profiles, single cells were dissociated 
from a cube excised from the center of each 
xenograft tumor, and cells were processed with 
10x Genomics 3’ v2. Expression programs were 
identified with cNMF as implemented in 
Kotliar et al. (36). 


Histopathology and machine learning 


Paraffin-embedded tumors were sectioned mul- 
tiple times, stained with H&E, and imaged, 
yielding 150 whole-slide images (WSIs) from 
52 blocks. The WSI were processed to obtain 
tissue patches, a fraction of which were used 
as input for a two-stage convolutional neural 
network based on previous work (59). Testing 
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was performed on nontraining patches and on 
melanoma patient samples from The Cancer 
Genome Atlas (TCGA), processed with the same 
pipeline. Additional and more detailed mate- 
rials and methods are available in the supple- 
mentary materials (30). 
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INTRODUCTION: Before the 1800s, dogs were 
probably primarily selected for functional roles 
such as hunting, guarding, and herding. Modern 
dog breeds are a recent invention defined by con- 
formation to a physical ideal and purity of lineage. 
Breeds are commonly ascribed temperaments 
and behavioral proclivities based on the purported 
function of the ancestral source population. By 
extension, the breed ancestry of individual 
dogs is assumed to be predictive of tempera- 
ment and behavior. Through our community 
science project Darwin’s Ark (darwinsark.org), 


A Biddability 


we enrolled a diverse cohort of pet dogs to 
explore how genetics shapes complex behavioral 
traits in this exceptional natural model. 


RATIONALE: Dogs are a natural system for investi- 
gating the genetics of complex traits. Millions of 
pet dogs live in human homes, sharing our envi- 
ronment, and receive sophisticated medical care. 
Behavioral disorders are treated with human 
psychiatric drugs, achieving similar response 
rates, and genetic studies suggest shared etiology 
with some human psychiatric conditions. 


How readily dog responds to human direction, especially in the context of training 
eight questions; defined by factor analysis of complete surveys for 10,253 dogs 


Heritability 
(1910 sequenced dogs) 
30.5 + 8.5% (+SE) 
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Effect of breed on behavior. (A) Biddability is among eight behavioral factors defined from surveys. 

SE, standard error. (B) Dogs in some breeds tend to score unusually high or low for this factor compared 
with dogs overall. (© and D) Border collies score lower on average for biddability (vertical line at median) 
but vary widely (C), including genetically confirmed border collies (D). (E) In mixed-breed dogs, border collie 
ancestry has a small genetic effect on biddability. [Photo credits: K. Wirka (Sprocket); M. Wisniewski 
(Caboose); B. Hoadley (Molly); M. Logsdon (Hunter); A. Macias (Lily); S. Staples (TWooie)] 
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We developed Darwin’s Ark as an open data 
resource for collecting owner-reported pheno- 
types and genetic data and invited any dog 
owner to enroll their dog. We paired this with 
low-pass sequencing to capture nearly all com- 
mon variation in this outbred population. Our 
inclusive approach achieved the large samples 
needed to investigate complex traits. 


RESULTS: We surveyed owners of 18,385 dogs 
(49% purebred) and sequenced the DNA of 
2155 dogs. Most behavioral traits are heritable 
[heritability (27) > 25%], but behavior only subtly 
differentiates breeds. Breed offers little predic- 
tive value for individuals, explaining just 9% 
of variation in behavior. For more heritable, 
more breed-differentiated traits, like biddabil- 
ity (responsiveness to direction and commands), 
knowing breed ancestry can make behavioral 
predictions somewhat more accurate (see the fig- 
ure). For less heritable, less breed-differentiated 
traits, like agonistic threshold (how easily a dog 
is provoked by frightening or uncomfortable 
stimuli), breed is almost uninformative. 

We used dogs of mixed breed ancestry to test 
the genetic effect of breed ancestry on behavior 
and compared that to survey responses from 
purebred dog owners. For some traits, like bid- 
dability and border collie ancestry, we confirm a 
genetic effect of breed that aligns with survey 
responses. For others, like human sociability 
and Labrador retriever ancestry, we found no 
significant effect. 

Through genome-wide association, we found 
11 regions that are significantly associated with 
behavior, including howling frequency and hu- 
man sociability, and 136 suggestive regions. 
Regions associated with aesthetic traits are 
unusually differentiated in breeds, consistent 
with a history of selection, but those associated 
with behavior are not. 


CONCLUSION: In our ancestrally diverse cohort, 
we show that behavioral characteristics ascribed 
to modern breeds are polygenic, environmentally 
influenced, and found, at varying prevalence, in 
all breeds. We propose that behaviors perceived 
as characteristic of modern breeds derive from 
thousands of years of polygenic adaptation that 
predates breed formation, with modern breeds 
distinguished primarily by aesthetic traits. By 
embracing the full diversity of dogs—including 
purebred dogs, mixed-breed dogs, purpose-bred 
working dogs, and village dogs—we can fully 
realize dogs’ long-recognized potential as a 
natural model for genetic discovery. 


The list of author affiliations is available in the full article online. 
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Behavioral genetics in dogs has focused on modern breeds, which are isolated subgroups with distinctive 
physical and, purportedly, behavioral characteristics. We interrogated breed stereotypes by surveying 
owners of 18,385 purebred and mixed-breed dogs and genotyping 2155 dogs. Most behavioral traits 
are heritable [heritability (h2) > 25%], and admixture patterns in mixed-breed dogs reveal breed 
propensities. Breed explains just 9% of behavioral variation in individuals. Genome-wide association 
analyses identify 11 loci that are significantly associated with behavior, and characteristic breed 
behaviors exhibit genetic complexity. Behavioral loci are not unusually differentiated in breeds, but breed 
propensities align, albeit weakly, with ancestral function. We propose that behaviors perceived as 
characteristic of modern breeds derive from thousands of years of polygenic adaptation that predates 
breed formation, with modern breeds distinguished primarily by aesthetic traits. 


odern dog breeds are less than 160 years 

old (~50 to 80 generations), a blink in 

evolutionary history compared with the 

origin of dogs more than 10,000 years 

ago (J, 2) (Fig. 1A). Prehistoric wolves 
likely adapted to use human refuse through 
changes in morphology, behavior, metabolism, 
and reproduction (3-7). Early humans may have 
given favored dogs increased access to limited 
resources, but there is little evidence of humans 
intentionally breeding dogs until 2000 years ago 
(8, 9). By contrast, the modern dog breed, em- 
phasizing conformation to a physical ideal and 
purity of lineage, is a Victorian invention (0). 
Before the 1800s, dogs were primarily selected 
for functional roles such as hunting, guarding, 
and herding (17)—heritable behaviors derived 
from the wolf predatory sequence (12). Modern 
breeds retain these component motor patterns, 
but their contexts, sequences, and thresholds 
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vary (12, 13). The extent to which ancient be- 
havioral propensities persist in modern breeds, 
defined primarily by aesthetics and often dis- 
connected from functional behavioral selec- 
tion, is unclear. 

Dogs with ancestry from a single modern 
breed (purebred dogs) predominate in genetic 
studies, which capitalize on their unusual pop- 
ulation history and limited genetic diversity 
(14-16), but are a minority of all dogs (3, 17). 
More than 80% of the nearly 1 billion dogs on 
Earth are free-living, free-breeding, and not 
under human control (e.g., village dogs) (3). 
Even in countries with large purebred pop- 
ulations, dogs with ancestry from more than 
one breed are common [~50% in the United 
States (78)]. Herein, we use the word “mutt” 
to describe a dog with ancestry from more 
than one breed, and potentially from non- 
breed populations. 

Modern breeds are commonly ascribed char- 
acteristic temperaments (e.g., bold, affectionate, 
or trainable), and behavioral proclivities on the 
basis of their purported ancestral function (e.g., 
herding or hunting) (79, 20). By extension, the 
breed ancestry of an individual dog is assumed 
to be predictive of temperament and behavior 
(21, with dog DNA tests marketed as tools for 
learning about a dog’s personality and training 
needs (22). Studies, however, found that within- 
breed behavioral variation approaches levels 
similar to the variation between breeds (23, 24), 
suggesting that such predictions are error 
prone even in purebred dogs. 

Behavioral traits in dogs are also a poten- 
tially powerful natural model for human neu- 
ropsychiatric disease. Pet dogs are regularly 
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treated with human psychiatric drugs, includ- 
ing selective serotonin reuptake inhibitors, 
and have similar response rates, and genetic 
studies suggest shared etiology (25-29). Dog 
behavioral traits are polygenic, driven by many 
small effect loci and the environment (30, 31). 
Given this genetic complexity, the success of 
survey-based phenotypes for mapping com- 
plex human diseases (32, 33), and the avail- 
ability of well-validated dog-owner surveys 
(34-37), a large-scale, survey-based study de- 
sign is ideal for investigating the genetics of 
canine behavior. 

Through our community science project 
Darwin’s Ark (darwinsark.org), we enrolled a 
diverse cohort of pet dogs to explore the com- 
plicated, and sometimes unexpectedly weak, 
relationship between breed and behavior. We 
show that using ancestrally diverse dog cohorts 
enables more powerful studies of behavioral 
genetics in this notable natural model. 


Results 
Survey data 


We developed Darwin’s Ark as an open data 
resource for collecting owner-reported pheno- 
types and genetic data in dogs. Dog owners were 
asked to complete 12 short surveys (117 ques- 
tions) on behavioral and physical traits (Fig. 1, 
B and C; figs. S1, S2A, and S3; and table 
$1). Darwin’s Ark surveys, each with no more 
than 10 questions, are designed to be easy to 
complete. Owner survey responses are suscep- 
tible to rater bias, including the influence of 
breed stereotypes. 

To ascertain size, we asked whether the 
dog is ankle-high, calf-high, knee-high, thigh- 
high, or hip-high rather than requiring own- 
ers to measure their dog (fig. S2A). This simple 
question proved effective when validated in 
three ways: (i) We measured dogs [N = 38 dogs; 
Pearson correlation coefficient (Rpearson) = 0.86; 
p=3x 10”); Gi) owners measured their dogs 
(N = 337 dogs; Rpearson = 0.845 p = 6 x 1079); 
and (iii) we tested correlation with breed aver- 
age height (38) in the subset of dogs that were 
purebred (N = 2025; Ryearson = 0.85; p < 2.2 x 
101°) (Fig. 1D, fig. $2, and data S1). 

Owners answered, on average, 100 + 34 (+SD) 
questions per dog; 70% of dogs have answers 
for more than 95% of questions (22) (Fig. 1, E 
and F). For the 48 questions drawn from the 
Dog Personality Questionnaire, between- 
question correlations matched published re- 
sults (22, 37) [Mantel’s correlation coefficient 
(r) = 0.95, p = 1 x 10”; fig. SI]. 

Overall, the Darwin’s Ark cohort (NV = 18,385; 
85% from the United States) is broadly similar 
to the US dog population. Half (49.2%) are re- 
ported as purebred (/8), and breed frequencies 
(Fig. 1G) correlate with US breed popularity 
(39) (Rpearson = 0.88; p = 1.48 x 10”) (table S2). 

To reduce the dimensionality of the sur- 
vey data, we performed exploratory factor 
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Fig. 1. The ancestry-inclusive Darwin’s Ark project collected surveys and genetic 
data from a diverse cohort of dogs. (A) Selection on behavior in dogs predates 
modern breeds, which have existed for ~1% of dog history (10). (B and C) Surveys 
include (B) 79 published behavior questions (34-37) and 39 new questions about 
heritable, easy-to-identify traits that fall into (C) four categories that potentially vary in 
heritability, including physical traits (fig. S3). (D) Owner responses to the size question 
(fig. S2) were highly correlated with measured size in 375 dogs (Pearson correlation). 
Boxes and whiskers represent 25% quartile, 75% quartile, minimum, and maximum, 
with horizontal line at median. (E) Owners of most dogs answered >95% of questions. 


(F) Upset plot visualizing the dataset. Six horizontal bars show the number of dogs 
subsetted by information type or breed category (“subsets”). Each column is a possible 
intersection of subsets, with black circles indicating the inclusion of a subset and 
vertical bars showing the number of dogs intersecting. Four rows (‘‘pools”) represent 
the dogs used in four major analyses. Plus signs denote the inclusion of an intersection, 
with total number of dogs (N) on the right. (G) The frequency of breeds among 
purebred dogs in Darwin's Ark (left), compared with the 14 most common breed 
ancestries we detected in all dogs through global ancestry inference (middle). More 
popular breeds tended to be guessed more frequently by MuttMix participants (right). 


Genetic data 


analysis (22), defined eight factors that ex- 
plain a cumulative 24.3% of variance in be- 
havior (fig. S4 and tables $3 and S4), and 
scored 16,522 dogs. We named each factor 
for the facet of behavior captured and es- 
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tablished standard terms for describing low- 
and high-scoring dogs (Fig. 2A). In subse- 
quent analyses, we examined the influence 
of breed and genetics on each behavioral fac- 
tor (Fig. 2B). 


29 April 2022 


Nearly half of the Darwin’s Ark cohort are 
mutts, an understudied population with un- 
characterized genomic diversity. We whole- 
genome sequenced (mean coverage = 46x; 
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ed, positive score). (B) Ina 
series of seven analyses, 
we explored how behavior 
elates to breed in the con- 
text of size, which is a 
strongly breed-differentiated 
trait. For each column, 
circle size is proportional to 
the minimum-maximum 
normalized values of (i) LD- 
corrected hsp, (ii) effect 
size of breed in ANOVA 
(confirmed breed), (iii) 
standard deviation of PPS 
(candidate breeds), (iv) 
standard deviation of LMER 
t scores, (v) —logio 
(minimum p) for MLMA, (vi) 
fraction of breeds with sig- 
nificant overlap (Pepr < 
0.05) between PBS and 


#4 Biddability 
biddable 


training (8 questions) 


assertive 


#6 Dog sociability 


(6 questions) 


questions) 


toy-directed i} not toy-directed 


How much dog displays motor patterns JA 
(e.g. grab-bite, chase), particularly ‘ 
towards toys (9 questions) 


How readily dog responds to human 
direction, especially in the context of 


#5 Agonistic threshold mang 
to Te 
How easily dog is provoked by a 
frightening, uncomfortable, or 
annoying stimulus (9 questions) 


fel highly sociable 


How comfortable dog is around other 
dogs, especially if unfamiliar to them 


#7 Environmental engagement 


high engage. JAY low engage. ] 


How interactive dog is with its 
familiar, daily environment (13 


ie independent Fe » 


Q121: 
size 
(positive 
control) 


GWAS, and (vii) maximum 
MAGMA logio(p) for 13 
brain regions in GTEx (85). 


#8 Proximity seeking 


affectionate id 


How readily dog solicits human 
contact and closeness (7 questions) \o 


Normalized 


score: 


range 30x to 67x) 27 pet dogs of unknown 
breed ancestry (the “Mendel’s Mutts” cohort), 
including 26 from the United States (one orig- 
inally from Mauritius and two from St. Kitts) 
and one from Ireland (data S2). We compared 
jointly called variant records in this cohort with 
published whole genomes for 530 purebred 
dogs from 128 breeds (22, 40). 

Sequencing mutts efficiently captures com- 
mon variants in the dog population, including 
variation not detected by sequencing large 
numbers of purebred dogs. Sequencing an 
additional mutt yields nearly as many new 
variants as sequencing a purebred dog, even 
when each new purebred dog is from a dif- 
ferent breed (Fig. 3A). In the 27 mutts, we dis- 
covered 11,974,853 biallelic single-nucleotide 
polymorphisms (SNPs) total, including 375,474 
variants not found in the 128 breeds (530 dogs). 

We also confirm that genetic variation private 
to a single breed is rare [38,097 + 13,206 (+SD) 
SNPs per breed after excluding Tibetan mastiffs, 
amore distantly related lineage (47) with 651,551 
private SNPs]. Breeds are not distinguished 
by asmall number of “breed-defining” variants 
(table S5). After analyzing all 13 breeds with 
more than 10 sequenced representatives, we 
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found just 332 SNPs (298 autosomal) exclusive 
to, and fixed in, a single breed (data S3) out 
of 16,702,091 SNPs total (0.002%). Tibetan 
mastiffs account for 142 SNPs (121 autosomal), 
with just 16 + 31 (+SD) SNPs (15 + 29 auto- 
somal) in each of the nine other breeds. These 
variants are unlikely to affect phenotype. An- 
notation with SNPEff classifies 98.2% (326) as 
occurring at loci without obvious function. 
Mendel’s Mutts have shorter runs of homo- 
zygosity than purebred dogs (Fig. 3B) and 
linkage disequilibrium (LD) that decays more 
rapidly (fig. $5). Thus, genotyping arrays de- 
signed with sufficient marker density for pure- 
bred dog studies miss much of the genetic 
variation in mutts. Average squared correlation 
coefficient (7°) between SNPs drops below 0.2 at 
9.8 kb, which is slightly longer than in village dogs 
(6.2 kb) but 5- to 10-fold shorter than in breeds 
(fig. S5). Because of the short LD, the markers 
included on the Illumina Canine HD Beadchip 
(N = 171,882) and the Axiom Canine Genotyping 
Array Sets A and B (NV = 1,011,992) tag only 19 and 
53% of genomic variation in mutts, respectively, 
compared with 51 and 85% in breeds (Fig. 3C). 
We adopted a low-pass sequencing and im- 
putation approach (42-46), using a reference 
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panel of 435 deeply sequenced dogs and other 
canids (data S4). We validated by resequenc- 
ing 11 mutts with high-coverage whole-genome 
sequencing (WGS) at low coverage [1.0x + 0.6x 
(+SD)]. We imputed, on average, 32,438,672 
SNPs and 13,910,371 insertions and/or deletions 
(indels) per dog, or 19.8 + 6.9 (+SD) variants per 
kilobase (~40x denser than the Axiom array), 
which was sufficient to tag nearly all the com- 
mon variants (94% tagged by a marker within 
100 kb and 87% within 1 kb) (Fig. 3C). Con- 
cordance between low-pass and 30x sequencing 
was 98.3 + 0.7% (+SD) (N = 11 dogs; ~7.7 million 
common SNPs), which was slightly lower than 
that between the Axiom array and 30x (99.3 + 
0.1%; N = 10 dogs; 0.83 million SNPs) but 
better than that between imputed array calls 
and 30x (97.3 + 0.3%; 7.6 million SNPs), in- 
cluding higher concordance for heterozygous 
genotypes (98.9 versus 98.3%) (data S5). 

Our final genetic dataset comprises 1715 
Darwin’s Ark dogs sequenced at 0.6x + 0.3x 
(+SD) coverage, each genotyped for 32,213,747 + 
141,060 (+SD) SNPs, and 440 dogs genotyped 
on the Axiom array and imputed using the 
same pipeline for 32,006,290 + 157,307 (+SD) 
SNP genotypes, for a total of 2155 dogs (data S6). 
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Fig. 3. Mutts have A 
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Axiom genotyping arrays 
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sequencing with imputation, 
particularly in outbred 
populations like mutts. 

(D and E) Global breed 
ancestry inference pipeline 
(assessed using simulated 
breed admixture) (D) 

calls breeds comprising 
>5% ancestry accurately 
but misses lower-frequency 
breeds and (E) can discern 
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(~24 to 36 years). Error bars 
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deviation across 10 simulations of 100 admixed genomes. (F) For dogs categorized as confirmed purebred based on owner reports, breed calling assigns >85% ancestry (vertical 
dashed line) to the owner-reported breed in 90% of dogs. For candidate purebred dogs, 58% meet this criterion (4.4% had no detectable ancestry from the owner-reported 
breed). Just 5% of dogs categorized as mutts have >85% ancestry from their most common breed (blue dashed line). (G) Most mutts have ancestry detected (>5%) from 
more than three breeds (1205 dogs total). (H) Examples of breed calling in four dogs with different ancestry types: Caramel, a purebred dog, who has 93% of her ancestry 
assigned to her owner-reported breed; Hubble, an Fy, goldendoodle; Coconut, who has apparent mutt ancestry mixed with dalmatian; and Clarence, a mutt with <45% ancestry 


from any one breed (*Staffordshire bull terrier). [Photo credits: M. Movassagh (Caramel); J. Luban (Hubble); A. Pensarosa (Coconut); R. Skloot (Clarence)] 


We selected dogs for sequencing on the basis of 
the enrollment date and survey completion rate 
(74% of dogs had sequencing funded by the 
owner’s donation). 


Breed ancestry assignment 

In our genetic data, owner-reported breed is 
a reasonable proxy for predominant genetic 
ancestry. We developed a breed-calling pipe- 
line using the software ADMIXTURE (22, 47) 
to infer ancestry using a supervised analysis 
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and a reference panel of 101 breeds (12 dogs 
per breed; 688,060 SNPs) collated from public 
and Darwin’s Ark data (Fig. 3, D and E; figs. S6 
and S7A; and table S2). Genetically inferred 
breed ancestry across dogs correlated well 
with the proportions of dogs registered to 
breeds in the American Kennel Club (93 breeds, 
Roearson = 0.745 p = 2.9 x 10°") (Fig. 1G). 

The top breed that was called matched the 
owner-reported breed in 98.7% of dogs de- 
scribed as registered purebred (N = 304) and 
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in 85.8% of all dogs for which owners report 
just a single breed (N = 885) (table S6). Dogs 
described as registered purebreds vary some- 
what in the percent ancestry assigned to the 
owner-reported breed (potentially because the 
reference data are not representative of all 
diversity in the breed or because of shared 
ancestry between breeds). We empirically set 
the threshold for defining a dog as genetically 
“purebred” as when 85% of ancestry is inferred 
to come from a single breed because 90% of 
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dogs described as registered purebred by own- 
ers fell above this threshold (Fig. 3F). 

We designated three classifications of breed 
ancestry: (i) “confirmed purebred dogs” were 
either described as registered purebred by the 
owner or confirmed by sequencing (3637 dogs), 
(ii) “candidate purebred dogs” included all 
confirmed purebred dogs and dogs with owner- 
reported ancestry from one breed (9009 dogs), 
and (iii) “mutts” were all other dogs (9376 dogs) 
(Fig. IF). Genetically inferred ancestry super- 
seded owner-reported breed when both were 
available, although discrepancies were rare 
(15 out of 556 candidate purebreds, 3 out of 
323 confirmed purebred) and were primarily 
nominal variations on the same breed (e.g., 
Landseer versus Newfoundland). Extrapolat- 
ing from the subset of dogs with genetic data, 
89.7% of registered purebred and 58.2% of dogs 
with owner-reported ancestry from one breed 
would, if sequenced, have >85% ancestry called 
from their owner-reported breed (Fig. 3F). Be- 
cause confirmed purebred dogs have a subs- 
tantially higher percentage of their ancestry 
assigned to their owner-reported breed, in sub- 
sequent analyses, we prioritized the confirmed 
set or, when the larger and more diverse can- 
didate set was useful, validated findings in the 
confirmed set. 

Mutts are rarely (17%) mixes of just two 
breeds. Most (66%) carry >5% ancestry from 
four or more breeds (Fig. 3G). We find that 
1071 dogs (70%) are highly admixed, carrying 
under 45% ancestry from any one breed (Fig. 
3H). The most common breed ancestry (data 
S7) is American pit bull terrier (9.9%) followed 
by Labrador retriever (6.0%), Chihuahua (5.1%), 
beagle (4.1%), and German shepherd dog (4.0%) 
(Fig. 1G and fig. S7C), varying by geographic 
region (fig. S7D). Purebred dogs had higher 
coefficients of inbreeding, as estimated from 
the proportion of the genome in runs of homo- 
zygosity [Fron = 0.06 + 0.04 (+SD); N = 633] 
than mutts (FRoy = 0.02 + 0.02; N = 1221) [for 
Student’s ¢ test p value (Dy-test) = 1.7 x 10°? t = 
28.4, degrees of freedom (df) = 776.8] (fig. S7B). 


Heritability of surveyed traits 


Combining genetic and survey data for 1967 
dogs, we found that genetic variation explains 
more than 25% of the variation in factor scores 
for human sociability, toy-directed motor pat- 
terns, and biddability (responsiveness to com- 
mands), as well as in responses to 38 of 83 
(46%) behavioral questions and eight physical 
traits. We estimated SNP-based heritability 
(h’ snp) With standard errors using restricted 
maximum likelihood (48) and LD score cor- 
rection (excluding 27 questions for which more 
than half of LD-stratified variance components 
were constrained) on genetic relationship 
matrices calculated for dogs with genetic data 
[8,518,951 autosomal SNPs with minor allele 
frequency (MAF) >2%] (data S8). Physical traits 
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are exceptionally heritable, with five out of 
eight exceeding 85% heritability. Retrieving is 
the most heritable behavioral trait [52.5 + 9.2% 
(+SE)], and human sociability is the most heri- 
table factor (factor 1, 67.3 + 13.0%). Behaviors 
related to intrinsic motor patterns and physical 
traits are more heritable than other behaviors 
(Fig. 4A). To assess whether these heritabilities 
are overestimated because of correlation be- 
tween traits and breed ancestry, we recalculated 
them by incorporating the top 10 principal com- 
ponent eigenvectors of genetic variance (49, 50). 
For the most part, we saw little change [median 
fold change of +0.02, 25% quartile = —0.049 
and 75% quartile = +0.140] between estimates 
(Rpearson = 0.97, p = 1.1 x 10°°*). Heritability 
decreases the most for biddability (factor 4, 
drops from 30.5 + 8.5% to 20.0 + 8.8%) and 
“circles before pooping” [question 64 (Q64), 
drops from 25.1 + 8.1% to 8.0 + 7.6%] (fig. S8). 


Breed explains some behavior variance 


In the owner surveys, breed explains a larger 
fraction of the variance in behavior pheno- 
types (110 questions and eight factors) than 
size, sex, or age, but the effect is relatively 
small (Fig. 4, B and C; fig. S9A; and data S9). In 
an analysis of variance (ANOVA) of confirmed 
purebred dogs representing 78 breeds, the 
breed effect, measured as generalized eta 
squared (ges) (57), averages 0.089 + 0.039 (+SD) 
(range 0.034 to 0.253), correlates with herita- 
bility (Rpearson = 0.89; p = 7.9 x 10”) (fig. S9B), 
and is about fivefold higher for the physical 
traits characteristic of breeds than for behav- 
ioral traits (fig. S9C). The same analysis using 
the less stringent “candidate purebred” breed 
definition is nearly perfectly correlated with 
the confirmed purebred analysis (Rpearson = 
0.99, p = 5.2 x 10°; N = 125), with ges values 
~30% lower (mean ratio = 0.70 + 0.11) (fig. S9D). 

Age explains little of the variation [0.018 + 
0.035 (+SD)] overall, but for a subset of traits 
it exceeds 0.05, including two factors (arousal 
level and toy-directed motor patterns) and 
nine questions, which include five designed to 
assess aging-related traits (36) (Fig. 4C and fig. 
S9E). Sex has little effect (0.009 + 0.044), 
except for “lifts leg to urinate” (Q66; ges = 
0.48). Size has virtually no effect (6.6 x 10°* + 
8.6 x 10°-*; range 2.5 x 10~” to 0.006). 


Breed is not a reliable predictor of 
individual behavior 


For several factors, score distributions for in- 
dividual breeds differ from the distribution of 
all dogs (fig. S10), with at least a few breeds 
over- or underrepresented in the highest- 
scoring quartile (fig. S11 and data S10). These 
distributions are based on owner survey data 
that may be influenced by breed stereotypes 
and other factors, and differences are not nec- 
essarily genetic in origin. For example, for 
human sociability (factor 1), an individual 
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Labrador retriever (1.4-fold), golden retriever 
(1.6-fold), American pit bull terrier (1.4-fold), 
or Siberian husky (1.7-fold) was more likely to 
score in the highest quartile than a randomly 
selected dog, whereas a German shepherd dog 
(0.78-fold), Chihuahua (0.72-fold), or dachshund 
(0.56-fold) was less likely. Even so, in every 
breed represented by 25 or more dogs, the 
majority scored within one SD of the Darwin’s 
Ark cohort mean (67.2 + 7.5% within one SD 
and 95.4 + 3.0% within two SD for confirmed 
purebred dogs). Behavioral factors show high 
variability within breeds, suggesting that al- 
though breed may affect the likelihood of a 
particular behavior to occur, breed alone is not, 
contrary to popular belief, informative enough 
to predict an individual’s disposition. 

We developed an interactive dashboard 
(https://darwinsark.org/muttomics) to illus- 
trate the value offered by breed for predicting 
behavior in any individual dog. For example, 
the chance that an owner scores an individual 
dog in the highest quartile for human socia- 
bility increases from 22% for a mutt to 40% if 
that dog is a golden retriever (fig. S11). Users 
can select one or a combination of character- 
istics, and the site dynamically updates to show 
the frequency in 23 breeds and in mutts. 


Measuring breed peculiarity 


We developed a permutation-based approach 
to measure when dogs of a particular breed 
are described by owners as having behavioral 
characteristics that significantly differentiate 
them from other dogs. For each phenotype, 
we compared dogs within each breed to dogs 
sampled randomly from the full cohort, pro- 
ducing a “population peculiarity score” (PPS) 
(22). We tested both “confirmed” breeds (sam- 
ple 2 = 50; up to six breeds) and “candidate” 
breeds (sample n = 25; up to 62 breeds) (Fig. 4, 
D and E; figs. S12 and S13A; and data S11). 

Overall, breeds were only subtly differ- 
entiated on behavioral phenotypes. In the 
confirmed purebreds, only 5.1% (30/583) of 
breed-phenotype pairs were significantly dif- 
ferentiated for behavioral questions, compared 
with 41.5% (17/41 pairs) for physical traits. 
Scores for behavioral questions were not more 
correlated with each other than were scores for 
physical questions (table $7). Intrinsic motor 
patterns and physical traits tend to be slightly 
more breed differentiated (fig. S13B). 

No behaviors are exclusive to any breed (fig. 
S14). Even in the breed with the lowest howling 
propensity, confirmed Labrador retrievers (Q17; 
N = 241; 78.4% never howl), 8% of owners 
report that their Labrador howls sometimes, 
often, or always. Although 90% (53/57) of con- 
firmed greyhounds are reported to never bury 
their toys (Q29), owners described three dogs 
as frequent buriers. 

We used the same permutation approach 
to measure how behavior changes as dogs age 
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Fig. 4. Breed and age 
(and not size) have 
subtle effects on the 
behavior traits surveyed. 
(A) LD-corrected SNP- 
based heritability is much 
higher for physical traits, 
and somewhat higher 

for motor patterns, 
compared with other 
behavioral traits 
(significance measured 
using Student's t test; 
Benjamini-Hochberg 
(BH)-adjusted p value is 
shown). Shaded regions 
indicate the probability 
density. (B and ©) ANOVA 
in confirmed purebred 
dogs shows that (B) 

the effect size of breed 
exceeds 15% for 6/7 
physical traits and some 
behavioral questions 
(labeled bars) and that 
(C) for factors, breed 
explains more variation in 
scores than age and sex, 
and size has no significant 
effects. In (C), asterisks 
indicate statistical signifi- 
cance (BH-adjusted p < 
0.05). (D and E) Permu- 
tations comparing breeds 
to randomly sampled dogs 
show that (D) many 
breeds are significantly 
differentiated on physical 
traits but that differentia- 
tion is much rarer for 
behavioral traits, including 
intrinsic motor patterns 
and (E) the eight behav- 
ioral factors. The selection 
of the most popular 
and/or most differentiated 
breeds is shown, with 

full results in fig. S12. 
Breeds in italics are 
epresented by fewer 
than 50 individuals and 
skew toward more 
extreme Z-scores (gray 
background; difference in 
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mean score = 0.29: pi-test = 2 x 10). (F) The effect of age, compared with the effect of breed, in the ANOVA shows that age explains nearly as 
much variance as breed for factor 2. (G) The PPS for arousal level and toy-directed motor patterns are significantly correlated with age, whereas biddability is more 
breed driven. Asterisks indicate significant results. 


(fig. S13C). Most questions (72%) and fac- 
tors (63%) are correlated with age [false dis- 
covery rate (FDR) p value (Dgpr) < 0.05] (figs. 
$15 and S16). Older dogs score nearly as com- 
posed in their arousal level (factor 2) as the 
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most composed breed (Great Pyrenees), and 
puppies are far more toy-directed in their 
motor patterns (factor 3) than one of the 
most toy-directed breeds (German shepherd 
dog) (Fig. 4, F and G). 


29 April 2022 


Testing breed stereotypes 

PPSs aligned, to a limited degree, with be- 
havioral stereotypes described in the breed 
standards (data $12). The American Kennel Club 
(AKC) describes each breed with a three-word 
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phrase (e.g., border collies are “affectionate, 
smart, energetic” and beagles are “friendly, 
curious, merry”) (52) (table S2). Breeds de- 
scribed with particular words are not behav- 
iorally distinct from other breeds; however, 
“charming” tends to describe breeds that are 
less toy directed (factor 3; DrpR = 0.039) (fig. 
$17). Grouping breeds by their proposed his- 
toric working role, as captured by AKC show 
groups (53), finds that four out of six show 
groups are peculiar on at least one factor 
(Fig. 5A). Herding breeds are more toy directed, 
more biddable, more engaged, and more aloof, 
whereas toy breeds are more independent and 
less dog social. Sporting breeds are more toy 
directed, and working breeds are more dog 
social (Fig. 5B and fig. SI8A). 

We found more support for breed behav- 
ioral stereotypes when comparing the PPS 
results to quantitative rankings from the 
Encyclopedia of Dog Breeds for each breed 
on 10 behavioral characteristics (19) (fig. 
S17B). Nine of the 10 correlated significantly 
with PPS for at least one factor. Breeds ranked 
high on “ease of training” tended to be more 
biddable (factor 4) and more toy directed 
(factor 3). Breeds ranked as low on “energy 
level” scored as more composed, more dog 
social, and less environmentally engaged 
(factors 2, 6, and 7). “Watchdog ability” and 
“friendliness towards strangers” both corre- 
lated with human sociability (factor 1), but in 
opposite directions. 

Overall, when comparing breeds to all pet 
dogs, breed differences based on owner re- 
ports align with some breed behavioral ste- 
reotypes, with one major caveat. Using survey 
data alone, we cannot distinguish environ- 
mental effects, including the effects of the 
stereotypes themselves (e.g., by influencing 
owner’s perception of their dog’s behavior), 
from genetic effects. 


Human perception of breed in mutts 


Half of the Darwin’s Ark cohort were mutts, 
offering an opportunity to test whether breed 
stereotypes have a genetic etiology. In pure- 
bred dogs, cultural breed stereotypes affect the 
perception of a dog’s behavior and thus may 
alter a dog’s environment (54, 55) or introduce 
rater bias into owner survey responses. If breed 
ancestry is not readily discernible in mutts, 
these nongenetic factors would be mitigated, 
allowing us to discern the genetic effects of 
breed from human perception and other envi- 
ronmental factors. 

To measure how accurately breed can be 
discerned from physical characteristics in 
mutts, we ran a 2-month community science 
project, MuttMix (muttmix.org) that recruited 
26,639 participants (Fig. 5C). For 30 mutts 
with complex genetic ancestry (i.e., no first- 
generation crosses) (22), we asked participants 
to guess their top three breeds. Between 
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13,662 and 14,160 participants submitted re- 
sponses for each dog. They accurately iden- 
tified, on average, 20.9 + 20.4% (+SD) of each 
dog’s breed ancestry (fig. S19A). Breeds com- 
prising a smaller proportion of a dog’s ances- 
try were especially challenging to identify 
(Rpearson = 0.61; p = 3.3 x 107") (fig. S19B). 
Thus, any effect of perceived ancestry on 
survey responses is likely to be substantially 
mitigated in mutts. 

The physical characteristics associated 
with a breed, like short fur (American pit bull 
terrier), short legs (dachshund), or pricked 
ears (Chihuahua), influenced how participants 
guessed, but this is an error-prone approach 
(Fig. 5, D to F, and fig. S19C). Dogs with an- 
cestry from more popular breeds had more 
correct guesses because participants tended 
to guess popular breeds more frequently (with 
the exception of the underguessed American 
pit bull terrier) (Fig. 1G and fig. S20). To control 
for this, we calculated how often we would 
expect to see each possible combination of 
breed guesses if breed guess rates matched 
the population frequencies in Darwin’s Ark 
(table S2) and compared the observed rate of 
correct guesses to the expected rate. Partici- 
pants guessed correctly more often than ex- 
pected for 73% of dogs (Fig. 5C and fig. S21A). 
The number of correctly guessed breeds by 
each participant for each dog was slightly 
higher for self-described dog professionals 
LN = 84,918; mean = 0.93 (SD 0.74)] than 
nonprofessionals [N = 333,614; mean = 0.81 
(SD 0.72)] (fig. S21B). 


Effect of breed ancestry in mutts 


We measured whether breed influences be- 
havior through genetics by examining only 
mutts with <45% ancestry from any single 
breed. This analysis illustrates the power of in- 
cluding complex mixes in behavioral studies, 
because any influence of breed stereotypes is 
mitigated when true breed ancestry is not read- 
ily discernible from appearance. We built linear 
mixed-effects regression (LMER) models for 
all factors and questions (breed ancestries as 
fixed effects; age and pairwise genetic related- 
ness as random effects) (22) (Fig. 6, A to C; fig. 
$22; and data S13). The proportion of variance 
in factor scores captured by genetic breed 
ancestry (marginal R”) averaged 9 + 3% (+SD), 
suggesting a weak but discernible genetic 
effect of breed on disposition (fig. S23B and 
data S14). Breed ancestry explained, on aver- 
age, 20 + 12% (+SD) of variance for physical 
traits in mutts. 

We validated the LMER method by confirm- 
ing that the ancestry effects for physical traits 
matched the breed standards for physical 
appearance (38). For example, ancestry from 
nine breeds (six with long fur and three with 
short fur) had a significant effect on fur length 
in mutts, and for each breed, the direction of 
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effect matches the breed standard. In total, 
we assessed 51 breed-trait pairs for four traits 
(size, white coat color, ear shape, and fur length). 
The direction of effect matched in 50 out of 
51 (table S8). For the size question (Q121), the 
LMER score is strongly correlated with the 
breed average height (Rpearson = 0.86; p = 2.3 x 
10°°; N = 19). 

Correlation between the LMER results and 
the PPSs confirm that some behavioral dif- 
ferences in mutts derive from differences in 
breed ancestry (N = 6333; Rpearson = 0.28; D = 
1.8 x 10°) (fig. $24 and data S15). For ex- 
ample, mutts with more border collie ancestry 
tend to be more biddable (factor 4; LMER ¢ = 
-4.6; DepR = 0.0002), consistent with survey 
data for confirmed border collies [PPS z = 
-4.6; corrected p value (Deorr) = 2 x 10°°]. 
Similarly, mutts with more Labrador retriev- 
er ancestry tend not to avoid getting wet 
(Q60; LMER ¢ = 3.8; Depr = 0.003), like many 
confirmed Labrador retrievers (PPS z = 4.3; 
Pcorr = 0.003). 

Discordance between the LMER results and 
breed differentiation measured by PPS may 
capture nongenetic influences on survey re- 
sponses such as breed stereotypes. Owners 
of confirmed golden retrievers, for example, 
tend to disagree that their dog is fearful of 
unfamiliar people (Q46; PPS z = 4.6; Deorr = 
0.002), which fits the breed stereotype that 
golden retrievers are friendly to strangers (19). 
In mutts, however, golden retriever ancestry 
had no effect on this question (LMER ¢ = 0.3; 
Prpr = 0.88), suggesting that the reported 
propensity may not be driven by genetics (fig. 
$23A). Similarly, whereas owners of confirmed 
Labrador retrievers tend to describe their dogs 
as more human social (factor 1; PPS z = 3.3; 
Deorr = 0.006), in line with the breed stereotype 
(“friendly, active, and outgoing”), in mutts, 
Labrador retriever ancestry has little effect 
(LMER t = 0.45 Depr = 0.90). Owners of con- 
firmed border collies tend to score their dogs 
higher on “wants to play” (Q2; PPS z = -3.6; 
Deorr = 0.04); this is consistent with the 
stereotype that border collies are “affection- 
ate, smart, and energetic” but discordant 
with the LMER results, which find no effect 
of border collie ancestry (LMER ¢ = 0.089; 
Depr = 0.97) 

By analyzing the effect of ancestry on be- 
havior in mutts, we can anticipate the likely 
behavioral propensities of breeds that are not 
well represented in our survey data (table S9). 
For example, Saint Bernard ancestry corre- 
lates with being more affectionate (factor 8, 
LMER ¢ = -4.1; Depr = 0.002) and Shar-Pei 
ancestry with being less toy directed (factor 3; 
LMER t = 4.6; Depr = 0.0002) (Fig. 6B). An- 
cestry from Chesapeake Bay retrievers increases 
with propensity to damage doors (Q40; LMER 
t = 4.2; p = 0.001) and escape from enclosures 
(Q35; LMER t = 3.5; Drpr = 0.02). 
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Fig. 5. Breed stereotypes can be assessed in mutts, where environmental 
effects (e.g., owner perception) are mitigated by the difficulty of accu- 
rately discerning the breed. (A) Breeds grouped by purported historic working 
roles are more differentiated (measured by mean PPS) on some factors than 
other breeds (58 breeds total), measured as a t statistic using Student's t test. 
Error bars represent 95% confidence intervals. (B) Toy and herding breeds 
illustrate groups with shifts in opposite directions (full results in fig. S18). 
Points are PPS for breed, vertical lines are mean PPS for group, boxes enclose 
the 25 to 75% quartiles, and horizontal lines extend from 1.5 times the 
interquartile range (IQR) below the 25% quartile to 1.5 times the IQR above 


Morrill et al., Science 376, eabk0639 (2022) 29 April 2022 


entropy explained by phenotypes in mutts 


the 75% quartile. Arrows indicate the direction of change in means, and words 
show the favored behavioral propensity. Herding breeds are Australian cattle 
dog, Australian shepherd, Belgian malinois, border collie, Catahoula leopard dog, 
collie, German shepherd dog, Pembroke Welsh corgi, and Shetland sheepdog. 
Toy breeds are bichon frise, Cavalier King Charles spaniel, Chihuahua, Havanese, 
Maltese, miniature pinscher, papillon, Pomeranian, pug, shih tzu, toy poodle, 
and Yorkshire terrier. (©) For each dog in the MuttMix survey, the ratio of 
observed:expected correct guesses for (i) one or more or (ii) two or more of the 
highest-content breed ancestries (blue indicates a ratio >1; open circles are 
not significant). (D) For six dogs that have a similar amount of genetic ancestry 
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detected from American pit bull terrier (~25 to 30%), participants guessed this 
breed at rates ranging from 1 to 60%. [Photo credits: J. O'Donnell (Jack); T. Fortier 
(Rosie); A. Phelps (Reilly); L. Moses (Rudy); R. Skloot (Clarence); M. Bishop (Esme)] 
(E) For three individual mutts, the most guessed breeds (top) differ from the 

its: E. Winchester (Maxine); 


genetically inferred breed ancestry (bottom). [Photo cred 


Genome-wide association studies in all dogs 

We first investigated breed-defining physical 
traits with known large-effect loci using a mixed 
linear model-based approach for genome- 
wide association (56) across 8,518,951 SNPs 
of >2% MAF. We controlled for population 
and family structure and cryptic relatedness 
in our complex cohort (600 purebred dogs, 
representing 88 breeds, and 1496 mutts) using 
a genetic relationship matrix in a mixed- 
model framework. None of the genome-wide 
association studies (GWASs) had unusual ge- 
nomic inflation [mean inflation factor (Agco.5) = 
0.985 + 0.016 (+SD); range 0.960 to 1.03; N = 
14], suggesting that the mixed-model frame- 
work controls for confounding due to popula- 
tion structure and other factors (57). 

We successfully replicated 17 published as- 
sociations for physical traits other than size 
(table S10), including for genes MITF (14) 
with white spotting [p = 2.89 x 107°”; SNP ef- 
fect size (b) = -0.78], FGF5 and RSPO2 (58) 
with coat length (p = 5.46 x 107°"; b = +0.37) 
and texture (p = 6.35 x 10°°; b = +0.11), USH2A 
(59) with roan and/or ticking (p = 5.31 x 10°"; 
b = +0.20), RUNX3 (60) with pheomelanin in- 
tensity (p = 4.11 x 1078; b = -0.20), and the 
6-defensin region (67-63) with brindle coat 
patterns (p = 2.50 x 10°; b = +0.35) (fig. $25, 
D to I, and data S16). 

For size, a quantitative trait, we replicated 
10 previously published associations (40, 64-71) 
(Fig. 6D, fig. S25A, and table S10) and found 
new associations to SARIB [p = 2.01 x 10°°; b = 
+0.12; metabolic disorders (72, 73)] and ANAPCI 
[p = 4.11 x 107°; b = +0.15; short stature in 
Rothmund-Thomson syndrome (74)]. By com- 
paring giant dogs (NV = 55) and then tiny dogs 
(N = 55) to average dogs (N = 1841), we dis- 
tinguished variants associated with gigan- 
tism (fig. S25B) and dwarfism (fig. S25C) 
specifically. The FGF4 retrogene locus, pre- 
viously associated with chondrodysplasia (67), 
is more strongly associated in the tiny GWAS 
(Dan = 1.16 x 107°; Diiny = 1.15 x 10°°°), dwarf- 
ing all other loci. 

The height associations were robust even 
in the absence of purebred dogs, suggesting 
that the all-mutt GWAS might offer equivalent 
power to one that includes purebred dogs. In 
dogs carrying less than 45% ancestry from any 
breed, a cohort with about half as many dogs 
(970 versus 1951), we identified all the major 
stature-associated loci (Rpearson = 0-91; p < 
1 x 10-8) as well as a new association in LRIG3 


(p = 7.29 x 10 *°; b = -0.31), a gene involved in 
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physical characteristics inf 


leave-one-out analysis (full 
likely guessed Irish wolfhou 


bone morphogenetic protein-mediated body- 
size regulation (75) (fig. S25J). 

Genomic predictions for height based on 
the GWAS-identified variants perform well 
in both purebred dogs and mutts, reflecting 
the strong selection on size among dog breeds. 
For a random forest regression model built 
using 1730 dogs and 2733 size-associated SNPs 
(p < 1x 10°°) (22), predictions carried a mean 
squared error of 0.3 (fig. S26) and 66% of pre- 
dictions fell within +0.5 units of the relative 
size score (fig. S2) (Rpearson = 0.77, p = 3.90 x 
10-°"), with no drop in accuracy for predic- 
tions made on mutts [predicted and true values 
differed by 0.46 + 0.35 (+SD) in purebreds 
versus 0.43 + 0.36 (+SD) in mutts; Di-test = 
0.08, ¢ = 1.75, df = 832]. Randomly selected 
SNPs, by comparison, performed poorly, with 
a mean squared error of 0.5 (45% of predic- 
tions within +0.5 units). Predictions for relative 
stature validated well against more precise 
measurements taken in person (N = 310 dogs; 
Rpearson = 0.91, p = 8.8 x 10°") (Fig. 6E and 
fig. S27). 


Behavioral GWASs 


Applying the same GWAS approach to the 
behavioral phenotypes identified 11 genome- 
wide significant (p < 5 x 107*) (76) and 136 sug- 
gestive (p < 1 x 10~®) associations (data S16). 
As with physical traits, the behavioral GWAS 
had minimal genomic inflation [mean Agco,5 = 
0.995 + 0.0087 (+SD); range 0.976 to 1.05; N = 
118]. The associations for behavioral traits were 
weaker, consistent with a more complex ge- 
netic architecture. They have not yet been in- 
dependently replicated. The most significant 
association, to “gets stuck behind objects” 
(Q36), mapped to a 380-kb region (p = 8.36 x 
10°"; b = +0.54) (Fig. 6F and fig. S25K) con- 
taining SNX29, a gene associated with cogni- 
tive performance in human GWASs (77-79). 
“Dog howls” (Q17) mapped to an intergenic 
region (p = 9.63 x 10°"; b = +0.54) (Fig. 6G and 
fig. S25L) between SLC38A1I and SCN3A, a 
voltage-gated sodium channel involved in the 
development of speech and language (80). The 
top association to a behavioral factor was for 
human sociability (factor 1), downstream of 
the gene HACDI (p = 2.41 x 107°; b = -0.36) 
(Fig. 6H and fig. S25M), a regulator of long- 
term memory (87) that is also associated with 
centronuclear myopathies (82). 

In our diverse cohort with dense genotyp- 
ing data, associated regions are smaller than 


R. Bacon (Jack); E. Stackpole (Bella)] (F) The dogs in (E) illustrate how a mutt's 


uence participant breed guesses. Points show entropy 


explained by traits using guesses for all mutts (22), and bars span values from a 


results in fig. S18). For example, 67% of participants 
nd for Maxine because of her coat furnishings. 


with sparser marker sets. We compared our 
behavior-associated regions to those found in 
an earlier study of a different complex trait 
(osteosarcoma) at the same linkage threshold 
(r? > 0.8). In the Darwin’s Ark GWAS, asso- 
ciated regions extend to a median 5.6 kb (25 to 
75% quartile = 2.0 to 14 kb, mean 16.8 kb) 
around suggestive (p < 1 x 10-°) behavioral 
loci and 5.7 kb at physical trait loci (1.4 to 22 kb, 
mean 26.2 kb). By contrast, intrabreed GWASs 
of osteosarcoma in three breeds with diverse 
population structures mapped at median ranges 
of 86 kb (25 to 75% quartile = 57 to 162 kb) in 
racing greyhounds, 54 kb (21 to 409 kb) in 
rottweilers, and 1 Mb (743 kb to 1.4 Mb) in Irish 
wolfhounds (83). This increased resolution may 
facilitate the search for causal variants. In the 
Darwin’s Ark GWAS, we can distinguish a 
region on chromosome 10 that is associated 
with stature (76.2 kb at 7? > 0.8; HMGAQ2; p = 
1.84 x 10°**; b = -0.31) from one associated 
with ear shape (118.7 kb at 7? > 0.8; MSRB2; 
p = 6.02 x 10-7; b = -0.33) that were pre- 
viously linked in interbreed GWASs (71, 84) 
(fig. S28). 

The mixed-model association approach may 
not fully control for spurious association that 
arises when a trait differs between breeds. An 
association for “focused in distracting situa- 
tions” (Q21) (chr32:4,512,005; p = 1.0 x 10°°; 
b = -0.22) (fig. S25N) mapped to a locus con- 
taining FGF5, a gene associated with long- 
coated breeds (58). This association was lost 
when we conditioned on the top coat length- 
associated SNP (chr32:4,509,367; p = 0.0001; 
b = -0.15) (fig. S29), which is linked to the top 
focus-associated SNP (r? = 0.33). The original 
association likely reflected the spurious differ- 
ence in focus scores between dogs with shorter 
and longer coats (P-test = 0.00023; 2012 dogs). 
Pleiotropy is unlikely because fur length ex- 
plains almost no variation in focus scores 
(ANOVA ges = 0.0004; p = 0.35; N = 2456). 
Consistent with this, the focus association 
on chromosome 32 weakens (chr32:4,512,005; 
p =12 10°; b = -0.18) when we include the 
top 10 SNP-based principal components in the 
mixed model (fig. S30). 

To assess whether spurious breed-trait cor- 
relations are a major confounder in our analy- 
ses, we reran all GWASs and included the top 
10 principal components in the mixed model. 
Only 6% (3/48) of our genome-wide significant 
associations were lost (p > 1 x 10-°) (data S16). 
We also tested whether the top 75 regions asso- 


those discovered using intrabreed GWASs 
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ciated with dog size (a highly breed-differentiated 
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Fig. 6. Genetics of aesthetic and 
behavioral traits in dogs and 
influence of breed ancestry in 
mutts. (A to C) In highly admixed 
dogs with no breed ancestry over 

45%, the fixed effects of breed ancestry 
on (A) physical traits, (B) behavioral 
factor scores, and (C) individual 
behavioral question scores are shown. 
(D) Manhattan plot for the GWAS of 
surveyed height (Q121) from 1951 dogs, 
including covariates for age and sex. 
Linkage blocks (r° > 0.2) associated 

(p <5 10°) with stature align 

with previous associations for body 
size in (a) IGFIR (68), (b) LCORL (40), 
(c) GHR (69), (d) SMAD2 (84), 

(e) HMGA2 (69) and the nearby 

(f) MSRB3, (g) a chromosome 12 
retrogene insertion of FGF4 (70), 

(h) IGF1 (64-66), (i) another FGF4 
retrogene on chromosome 18 (67), 

(j) MEDIS3L (40), and (k) IGF2BP2 (40). 
Two previously unknown associations 
were found spanning JADE2 and SARIB, 
and in ANAPCI. (E) Random forest 
models based on size-associated 

SNPs (p < 1x 10°) accurately predict 
body size and correlate strongly 

(N = 310 dogs; Ryearson = 0.91, 
p=88x 10" t= 37.451, df = 308) 
with real measurements in those 
dogs. (F to H) Regional association 
plots for (F) scores on Q36 “gets stuck 
behind objects,” (G) human sociability, 
and (H) frequency of howling from 
Q17. In addition to protein coding 
genes (black boxes), we also show 
representative open chromatin regions 
(rOCRs; narrow vertical lines). We 
annotated rOCRs genome-wide using 
ENCODE methods (101) applied to 
canine ATAC-seq (assay for transposase- 
accessible chromatin using sequencing) 
data from 14 tissues (102) and mam- 
malian sequence constraint (103). 

(I) Breeds show high genetic differenti- 
ation, measured as the population 
branch statistic, overlapping physical 
trait associated loci compared with 
~100,000 randomly permuted regions 
(N = 1232, mean z = 0.49, p =7.3 x 
10°). Regions associated with behav- 
ioral factors (N = 512, mean z = 0.03, 
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p = 0.224) and question scores (N = 9317, mean z = 0.00, p = 0.603) do not show such differentiation in breeds. Red circles indicate mean, with horizontal lines at the 25% 
quartile, median, and 75% quartile. The shaded area is the probability density, with significant differentiation in red. ns, not significant; ****p < 0.0001 (Student's t test). 


trait) were enriched for SNPs associated in 
any of the 119 behavioral GWASs (fig. S31). 
Only one GWAS (Q66, “lifts leg to urinate”) 
was significant [adjusted p value (p,q) = 0.013]. 
Thus, although spurious associations due to 
aesthetic traits are a concern in multibreed 


Morrill et al., Science 376, eabk0639 (2022) 


undiscovered. The SNPs that we found to be 
associated with heritable (h?syp > 0.1) behav- 
ioral traits account for a smaller proportion 
of overall heritability than do aesthetic trait 
associations (22), consistent with a complex 
genetic architecture. For the 14 physical traits, 


GWASs, they are likely rare in the GWAS run 
on our diverse cohort. 
Unattributed heritability 


A large proportion of the genetic and envi- 
ronmental contributions to behavior remains 
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53.0 + 30.2% (+SD) of heritability is attribut- 
able to associated SNPs (p < 1 x 10°), but for 
the eight behavioral factors and 73 questions, 
this drops to 21.0 + 12.8% and 27.9 + 20%, 
respectively. The six associated loci accounted 
for 42.7% of the genetic component of dog 
sociability (h?syp = 14.8 + 6.1%), whereas just 
4.3% of highly heritable human sociability 
(h> snp = 41.5 + 9.1%) could be explained by 
its single associated region. 


Brain-expressed genes enriched in 
behavior GWASs 


Regions associated with dog behavioral phe- 
notypes are enriched in brain-expressed genes. 
We cataloged the genes expressed in 38 tissue 
types, including 13 brain regions, using human 
GTEx data (85), an approach used previously 
in dogs (86). We also collated genes from 
curated lists for obsessive-compulsive disorder 
(OCD) (87), autism-spectrum disorders (88), 
and schizophrenia (89, 90). Using MAGMA 
(22, 91), we tested all GWASs for enrichment 
(data S17). Regions associated with toy-directed 
motor patterns (factor 3) had the strongest 
enrichments, which were for genes expressed 
in the hippocampus and in the basal ganglia of 
the nucleus accumbens, caudate, and putamen. 
Associations for “not keen on new situations” 
(Q84) were enriched for hypothalamus- 
expressed genes (fig. S30). Overall, enrichments 
in genes associated with neuropsychiatric con- 
ditions were weak, peaking for the enrichment 
of human OCD genes in Q84-associated regions 
(p = 0.0012; Paaj = 0.24). 


Aesthetic selection predominates in breeds 


Associations to physical traits, but not be- 
haviors, tend to overlap signals of genetic dif- 
ferentiation in modern breeds, suggesting 
that aesthetics, and not behavior, has been the 
focus of selection. We tested for sites with ex- 
cess differentiation in each breed with >12 dogs 
using the population branch statistic (PBS) 
test (92), using all dogs (NV = 3802 to 3878) and 
wolves (N = 48) as the two outgroups across 
~27.6 million SNPs from publicly available and 
our genetic data (data S4 and S6). Among 
the top 0.1% of breed-differentiated regions 
(26 + 6 regions per breed), we validated ge- 
netic signals of selection reported at EPASI, 
for hypoxia tolerance, in Tibetan mastiffs (93); 
at CACNAIA, unknown phenotype, in two sled 
dog breeds (94); at ESRI, unknown pheno- 
type, in long-legged sighthounds (40); and at 
ALX4, a blue eye color gene, in Siberian huskies 
(95) (data S18). 

We used permutation (22) to test whether 
PBS scores are unexpectedly high in regions 
associated with traits (data S19) and found 
that, whereas physical trait-associated regions 
are more differentiated, those associated with 
behavioral traits are not (mean 2 = 0.491 versus 
-0.001; Di-test = 4 x 10) (Fig. 61). Considering 
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all moderately associated GWAS regions (p < 
1 x 10°), 25 of 65 (39%) physical trait loci are 
unusually differentiated, whereas only 38 of 
515 (7%) behavioral trait loci are, and a sub- 
set of those are also connected to physical 
traits (data S20). Differentiation at physical 
trait loci is consistent with ongoing selection 
to meet strict morphometric standards in 
breeds (38), and the lack of overlap for most 
behavioral traits suggests weaker or absent 
selection. 

The lack of differentiation at behavioral loci 
is not inconsistent with heritable behavioral 
differences in breeds, which may reflect ge- 
netic drift or selection that predates breed 
creation, neither of which the PBS test is 
designed to detect. To this point, neither of 
the two loci associated with howling are dif- 
ferentiated in either the Siberian huskies or 
beagles, even though ancestry from these 
breeds influences howling propensity. 


Discussion 


Behavioral traits are subtly differentiated in 
modern breeds (Fig. 2B). Furthermore, breed 
offers only modest value for predicting the 
behavior of individual dogs. For more herita- 
ble and more breed-differentiated traits, like 
biddability (factor 4), knowing breed ancestry 
can make behavioral predictions somewhat 
more accurate in purebred dogs. For less her- 
itable, less breed-differentiated traits, like 
agonistic threshold (factor 5), which measures 
how easily a dog is provoked by frightening, 
uncomfortable, or annoying stimuli, breed is 
almost uninformative. 

In our ancestrally diverse cohort, we show 
that behavioral characteristics ascribed to 
modern breeds are polygenic, environmen- 
tally influenced, and found, at varying preva- 
lence, in all breeds. They likely naturally arose 
over millennia as dogs followed human mi- 
grations and adapted to new human technol- 
ogies (2). The tight bottlenecks that established 
modern breeds captured ancient variation, at 
varying frequencies, with subsequent genetic 
drift or selection further shaping modern breeds 
(Fig. 3, A and B). 

We found no evidence that the behavioral 
tendencies in breeds reflect intentional selec- 
tion by breeders (Fig. 61) but cannot exclude 
the possibility. Current datasets are too small 
to detect more subtle, recent directional selec- 
tion, which requires hundreds of thousands 
of samples (96). In dogs, breed demographic 
history makes detecting selection particularly 
challenging (J, 97). 

Canine behavioral disorders are a proposed 
natural model for human neuropsychiatric dis- 
eases (25, 27). Here, we show that large-scale 
behavior GWASs in dogs are tractable, identi- 
fying dozens of loci associated with behavioral 
traits in dogs. These associations explain a 
fraction of overall heritability, suggesting that 
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still-larger sample sizes are needed. Our study 
design, combining owner-engagement with 
low-pass sequencing (45), makes this eminently 
achievable. We anticipate that this approach 
will be even more powerful once methods for 
accurately assigning local ancestry in indi- 
viduals with >100 potential source popula- 
tions (compared with two or three in human 
studies) are validated and incorporated into 
dog GWASs (98). 

As dog studies grow in scale and complexity, 
it is crucial that we meet the standards of sta- 
tistical rigor developed by the human genetics 
community and carefully account for con- 
founding by artificial selection for aesthetic 
extremes in modern breeds (99), which can 
create misleading signals of association. One 
approach for studying behavior in dogs has 
been to compare breeds, rather than indi- 
viduals, using breed-level behavioral pheno- 
types. The wide variability in behavior within 
breeds, and the potential for spurious correla- 
tions with breed-defining aesthetic traits, sug- 
gests that any discoveries made using this 
approach should be carefully validated using 
other methods. 

To date, dog genetics has focused on mod- 
ern breeds, which capture just a tiny fraction 
of global canine diversity. Although this made 
early genomic studies feasible (14), it limits 
discovery today (J00). By embracing the full 
diversity of dogs, including purebred dogs, 
mutts, purpose-bred working dogs, and vil- 
lage dogs, we can fully realize dogs’ long- 
recognized potential as a natural model for 
genetic discovery. 


Materials and methods summary 


Materials and methods described in full de- 
tail can be found in the supplementary mate- 
rials (22). 


Survey data collection 


We collected consent, profile information, and 
surveys for 18,385 dogs enrolled by their own- 
ers via the Darwin’s Ark platform (https:// 
darwinsark.org) on or before 15 November 2019. 
Profile information included the dog’s approx- 
imate birth date, sex and sterilization status, 
suspected or known breed(s), purebred registra- 
tion, and/or photograph. We collected 12 sur- 
veys, including 11 about behavior (10 questions 
each) and one about physical characteristics 
(eight questions), for a total of 118 survey items 
(table S1). All responses to survey questions 
were time stamped, and ages at the time of 
survey were calculated relative to reported 
birth date (22). 

The 110 behavioral questions all used a five- 
point Likert scale: (i) 81 questions had options 
of strongly agree, agree, neither agree nor 
disagree, disagree, or strongly disagree; and 
Gi) 29 had options of never, rarely, sometimes, 
often, or always. We sourced 79 behavioral 
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questions from published and validated sur- 
veys: (i) Dog Personality Questionnaire (DPQ/ 
DPQL; 45 questions) (37); (ii) Canine Health- 
related Quality of Life Survey (CHQLS; 11 ques- 
tions) (36); (iii) Dog Impulsivity Assessment 
Scale (DIAS; 18 questions, including one also 
in DPQ) (34); and (iv) Canine Cognitive Dys- 
function Rating scale (CCDR; six questions) 
(35). We validated the performance of be- 
havioral surveys using a Mantel’s test on the 
inter-item correlation distance (d = 1 - |r|) 
matrices between published data for 48 DPQ. 
items (NV = 2556 dogs) and our data. We in- 
cluded 31 new behavior questions developed 
with input from canine behavior professionals 
in the International Association of Animal 
Behavior Consultants. The physical character- 
istics survey used a variety of response types 
(table S1). Answers of “I’m not sure,” “I don’t 
know,” “not sure,” and “surgically cropped ears” 
(Q125) were excluded. 

Dog size was measured through Q121: “When 
DOG is standing next to someone of average 
height, how high are HIS shoulders?” This 
question was validated in three ways (fig. S2 
and data S1): (i) owners were provided with 
a measuring tape by mail and instructed to 
measure the height from their dog’s shoulder 
to the ground using the provided measuring 
tape (337 dogs); Gi) dogs were measured 
(height to withers) by professionals during the 
2017 Somerville Dog Festival in Somerville, 
MA (38 dogs); and (iii) owner-reported size 
was compared with average breed height 
(2025 purebred dogs). 

We performed exploratory factor analysis 
on the behavioral surveys (10,253 dogs with 
responses for all 110 questions) and extracted 
the optimal number of factors as estimated by 
the Horn’s parallel analysis and optimal coor- 
dinates heuristic methods (20 factors; table $3). 
A varimax orthogonal rotation was applied to 
generate a structure matrix with factor load- 
ings for each item, and items with low pat- 
tern or structure loadings (less than +0.3) 
were removed. We generated factor scores for 
6269 additional dogs with responses to >80% 
of questions by populating missing responses 
through random sampling. The dog’s age for 
each factor is the mean age for all responses to 
included questions. 


Sample collection 


Animal study protocols for saliva and blood 
collection from dogs were approved by the 
UMass Chan Medical School Institutional 
Animal Care and Use Committee (IACUC) 
(no. A-2520-18). We sent or gave owners saliva 
collection kits (DNA Genotek PG-100 saliva 
swabs) for sampling. For a subset of dogs, 
owners provided blood collected by their 
veterinarian. We selected dogs for sequencing 
primarily based on survey completeness and 
enrollment date. Of 1715 samples submitted 
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for low-coverage DNA sequencing, 159 samples 
(7.4% of 2155 dogs in the genetic dataset) were 
funded by owner donations to the Darwin’s 
Ark Foundation, a 501(c)(3) nonprofit orga- 
nization (82-3942341). 


High-coverage genome sequencing and analysis 


We performed high-coverage [45x + 10x 
(+SD)] WGS on samples from 27 putatively 
mixed-breed dogs (the Mendel’s Mutts cohort) 
(data $2). For the initial 22 mutts sequenced, 
we performed joint variant calling with pub- 
licly available data for 620 other dogs and 
34 canids (data S4) using the Genome Analysis 
Toolkit (GATK3) (22) on the CanFam3.1 refer- 
ence assembly. The final variant call file con- 
tained 34,191,821 SNPs and 11,943,064 indels. 
For the five mutts sequenced later, genotypes 
were called for the same set of variants using 
GATK3 HaplotypeCaller. 

We compared cumulative variant discovery 
using purebred versus mutt genomes using 
chromosome 13 as a random proxy for the 
whole genome. We tested six cohorts: one dog 
sampled at random per breed (N = 128 possible 
dogs), Mendel’s Mutts (NV = 27 dogs), and the 
four breeds with >27 individuals sequenced 
(22). We computed the cumulative distribu- 
tion of the fraction of 619,031 variants dis- 
covered using 557 purebred dogs versus using 
10 dogs randomly chosen and ordered with- 
in each cohort and computed the 95% confi- 
dence interval using random reordering within 
each cohort. 

We compared the lengths of detected runs 
of homozygosity (ROH) in mutts, dog breeds, 
and village dog genomes across biallelic SNPs 
using PLINK v1.90b6.21 with a minimum length 
of 100 kb and 100 SNPs, with at least 1 kb 
per SNP (22). We then randomly sampled 7 = 
464 runs (the mean number of ROH detected 
per mutt) from the pool of ROH detected in 
mutts, purebred dogs, and village dogs, re- 
sampling N = 100 times. 

We measured LD in mixed-breed dogs 
(Mendel’s Mutts), breeds (golden retriever, 
Labrador retriever, Leonberger, and Yorkshire 
terrier), village dogs, and wolves by randomly 
sampling 25 dogs from each cohort and, for 
20,000 randomly sampled biallelic SNPs, 
measuring 7” to all SNPs within 100 kb. We 
assessed tagging of genetic variation using 
genotyping arrays by measuring 7” between 
the same set of random SNPs and the sub- 
set of SNPs on the array (171,882 for the 
Illumina HD Canine Genotyping Array and 
1,011,992 for the Axiom Canine Genotyping 
Array Sets A and B). 


Low-coverage sequencing and imputation 


We piloted a low-pass sequencing and im- 
putation approach (42-46) using a panel of 
reference haplotypes from high-coverage 
whole-genome sequences. Autosomal variant 
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calls were inferred directly from sequencing 
reads through Gencove loimpute software 
(46) and a panel of reference haplotypes 
from publicly available WGS data [mean cov- 
erage 22.9x (SD 14.2x)] for 435 canids (data 
S4). The imputation process generated un- 
filtered genotypes for 32,438,672 SNPs and 
13,910,371 indels with imputation genotype 
probability (GP) scores per genotype per dog. 
We validated performance by comparing 
low-pass sequencing and imputation [1.0x + 
0.6x (+SD)] to array data (Axiom array) and 
high-coverage WGS data for 11 mutts with 
high-coverage WGS at low coverage. We also 
performed down-sampling of high-coverage 
WGS and subsequent imputation by the same 
method. 

We combined low-pass sequencing data for 
1715 dogs [0.6x + 0.3x (+SD)] with data for 
440 dogs genotyped on the Axiom array and 
imputed using the same haplotype reference 
panel (excluding genotypes of GP < 0.7). Af- 
ter merging, we performed additional quality 
control based on MAF, call rate, and Hardy- 
Weinberg equilibrium and validated owner- 
reported sex (22). The final dataset included 
8,518,951 biallelic, autosomal SNPs and 2155 
dogs at a genotyping rate of 97.5% (1084 males 
and 1071 females). 


Breed ancestry assignment 


We assembled a reference panel of 101 of the 
most common dog breeds in the United States 
(table S2) using high-coverage WGS for 380 dogs 
of 74 breeds (data S4), low-coverage WGS for 
115 dogs of 54 breeds, Axiom genotyping array 
data for 109 dogs of 43 breeds, and Illumina 
CanineHD arrays for 883 dogs of 90 breeds 
(22). For each breed, we selected 12 dogs for 
inclusion, prioritizing high-density raw data 
and genetic diversity within breeds. We im- 
puted genotypes for low-density data using 
the 435-canid panel of reference haplotypes. 
We retained SNPs genotyped in more than 
80% of dogs and at a MAF of at least 5%. 
Among ancestry-informative SNPs of Hudson’s 
estimator of fixation index (Fsr) > 0.15 be- 
tween breeds, we selected a dense set of 
2,468,442 markers (7? > 0.9 within 5 kb) for 
admixture simulations and a sparser set of 
688,060 markers (7? > 0.5 within 50 kb) for 
ancestry inference. 

We used a Monte Carlo approach to gen- 
erate simulated admixed genomes of known 
ancestral haplotypes and then compared the 
breed ancestry composition with ancestry in- 
ferred using ADMIXTURE (22). We simulated 
admixed individuals through N = 15 genera- 
tions of admixtures with the following proce- 
dure: NV + 1 random individuals from different 
breeds were selected to contribute to the ad- 
mixture. With each iteration, recombination 
was simulated to incorporate a new individ- 
ual. Recombination was treated as a Poisson 
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event that occurred, on average, once every 
Morgan. Simulations ran on 10 independently 
drawn datasets of six dogs per reference breed 
to create 1000 admixed individuals of known 
ancestry. We inferred global ancestry for sim- 
ulated individuals using the supervised mode 
of ADMIXTURE (random seed = 43) and the 
reference genotypes from six dogs reserved 
from each breed. 

We then performed supervised admixture 
analysis of the Darwin’s Ark genetic cohort. 
Genotype data from all query dogs was merged 
with all reference-breed data and filtered 
for SNPs in the global breed ancestry panel. 
Global ancestry from the 101 reference breeds 
were inferred using the supervised mode of 
ADMIXTURE (random seed = 43) that was 
supplied with reference population assign- 
ments. Population weights less than 1% were 
discarded from individual ancestry results. 

We combined breed ancestry assignments 
with survey data for dogs without genetic 
data to define three breed sets as decribed in 
the results: confirmed purebred dogs, candi- 
date purebred dogs, and mutts. 


Heritability analysis 

We estimated the SNP-based heritability 
(h’ snp) of surveyed traits using restricted 
maximum likelihood (REML) analysis imple- 
mented in the genome-wide complex trait 
analysis (GCTA, version 1.92.3 beta 3) software 
tool (56). We calculated LD scores in 250-kb 
regions using a block size of 10,000 kb with 
an overlap of 5000 kb between blocks. We 
generated a genetic relationship matrix (GRM) 
for the genetic cohort of 2155 dogs, as well as 
multiple GRMs calculated from SNPs strati- 
fied into LD score quartiles (22). The four LD- 
stratified GRMs were used to run REML 
analysis (GREML-LDMS) and estimate snp 
with standard errors (data S8). 


Population peculiarity scoring 


We applied a custom permutation-based anal- 
ysis (22) to test whether groups of dogs de- 
fined by breed or age differed significantly in 
survey responses from randomly sampled 
groups on any survey item or factor. We in- 
cluded all dogs with any survey responses. 
For each permutation and a given sample 
size N (table S14), we calculated the mean 
(the observed test statistic) for each normal- 
ized survey response or factor score for N dogs 
sampled from among dogs of each grouping. 
For each permutation, we also calculated the 
mean for a random sampling of size N from 
the full dataset (the permuted test statis- 
tics). We counted how often the observed 
test statistics for each population were higher 
than the permuted test statistics. We ran a 
total of 500,000 permutations. To obtain the 
PPSs, we calculated the one-tailed empirical 
p values and generated zg-scores matching 
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the survey directionality. We also calculated 
the two-tailed p values corrected for multiple 
testing by a maxT procedure that preserves 
the correlational structure between survey 
items (22). 


Ancestry perception survey 


We designed the web-based MuttMix survey 
(muttmix.org) to assess perceptions of breed 
ancestry in mixed-breed dogs by nonowner 
observers. Participants self-identified as either 
general public or dog professional (yes or 
no to “Do you work with dogs professionally 
and/or are you a breeder?”). The survey con- 
sisted of 30 mixed-breed dogs with ancestry 
assignments and one purebred dog. Owners 
provided front and side photographs and a 
short video. Owners reported the dog’s rela- 
tive size (fig. S2F) and other physical descrip- 
tors. The images and information that were 
provided were shared with participants, who 
were asked to guess, for each dog, the three 
breeds detected in largest proportion (22). 
The survey launched on 16 April 2018 and 
closed on 16 June 2018, and responses were 
collected from 26,639 people over a 2-month 
period. 

We compared breed guesses to genetically 
inferred breed ancestry (22). Any breed call 
below 5% was removed and only breeds 
offered as survey options were examined. To 
calculate the average total percentage of an- 
cestry guessed correctly, we first calculated the 
percentage guessed correctly by each user for 
each dog by summing the percent genetic 
ancestry attributed to their top three breed 
guesses. To assess the accuracy of user guesses 
of breed ancestry, we first counted the number 
of breed guesses for a given dog that were 
among the top two or three breeds that were 
genetically detected. 

We measured how specific physical attrib- 
utes affected participants’ breed choices using 
entropy analysis (22). For each breed option, 
we calculated how well mutts’ phenotypes, 
defined binarily for each of eight different 
traits (height, leg length, ear type, coat type, 
coat length, coat furnishings, white spotting, 
and pigmentation), distinguished between 
participant guesses of presence versus absence 
of each ancestry. We applied a leave-one-out 
analysis, omitting guesses for each mutt in 
series, to assess the impact of guesses for each 
mutt on entropy reduction. To calculate sig- 
nificance, we randomized trait assignments 
across mutts and then asked whether entropy 
reductions from true traits were greater than 
those randomly assigned. 

We calculated how often we expected to see 
each possible combination of breed guesses by 
chance, assuming the guess rate for each breed 
to be the overall frequency of that breed (22) 
(table S2). We then calculated the observed 
rate of guesses with 1+, 2+, and 3 breeds cor- 
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rect for each dog and then calculated the ratio 
of the observed-to-expected rate. 


LMER models 


To measure the relationship of genetic breed 
ancestry to physical and behavioral pheno- 
types, we constructed LMER models using all 
dogs with <45% ancestry from any single breed 
(1002 dogs total). We treated normalized ques- 
tion and factor scores as independent varia- 
bles, breed ancestry as fixed effects, and age 
as random effect. For each survey item and 
factor, we built a model with REML to obtain 
unbiased estimates, standard deviations, and 
Wald statistics (t.val) for the fixed effects of 
breed on factor score and performed ANOVA 
to obtain the breed F statistics. To obtain the 
likelihood ratio for each breed, we constructed 
models using maximum likelihood with and 
without the breed and performed an ANOVA. 


GWASs using mixed linear models 


We performed genome-wide mixed linear 
model-based associations in the Darwin’s 
Ark genetic cohort using the “leave-one- 
chromosome-out” approach (MLMA-LOCO) 
implemented in GCTA (56) with categorical 
covariates for sex and data type (genotyping 
or low-pass sequencing) and quantitative co- 
variates for height and age for nonmorpho- 
logical traits. Because LD is nearly as short 
in diverse dogs as in humans, we used the 
thresholds for genome-wide significance (p = 
5 x 10°) and suggestive associations (p = 1 x 
10°) that are conventionally used in human 
GWASs (J, 76). 

We defined regions of association by clump- 
ing SNPs in LD (7° > 0.2 and 7” > 0.5) and near 
(<250 kb) associated index SNPs using PLINK 
(data S16). When comparing region sizes to the 
earlier osteosarcoma study (83), we used the 
same clumping thresholds. To assess how much 
phenotypic variance was explained by associ- 
ated regions, we derived genetic relationship 
matrices for regions of suggestive association 
(p = 1x 10°) with each trait and the set of all 
other SNPs and estimated the partitioned heri- 
tability as the proportion of total heritability 
unattributed by discovered associations. 

We built a predictive model for height as 
responses to survey Q121 for 1730 dogs older 
than 18 months and assessed its power through 
10-fold cross validation (9/10 training, 1/10 test- 
ing). At each round, we performed GWASs on 
the training set, selected SNPs for prediction 
at given p value cutoffs, built a random forest 
regression model, and assessed accuracy using 
the testing set. The reported accuracy and mean 
squared error are averaged across 10 rounds. 

We tested for enrichment of association 
summary statistics in three types of gene sets 
(22) by applying MAGMA (version 1.09) (91), a 
method that accounts for region size, variant 
count, and LD (data S17). 
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Genetic differentiation of breeds 


We calculated genome-wide normalized PBS 
scores using the Hudson estimator of fixation 
(Fsr) for each breed (Vp > 12 dogs, maximum 
88) relative to dogs overall (Np = 3890 — Np) 
and wolves (Nw = 48) in sliding windows of 
100 kb by 10 kb (data S18) over ~27.6 million 
SNPs from publicly available and Darwin’s 
Ark genetic data (22). After dividing locus tests 
into physical trait, behavioral question, and 
behavioral factor associations, we performed 
a one-tailed Student’s ¢ test to test whether the 
observed maximum PBS within associated loci 
exceeded what we expect by random chance 
(data S20). To test whether allele frequencies 
at SNPs associated with behavioral or physi- 
cal traits tended to differ more in breeds, we 
calculated the max Fs; observed between one 
of the top 10 breeds and all other dogs and 
compared this to 29,903 randomly sampled 
SNPs using a one-sided f test. 
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Cancer cells use self-inflicted DNA breaks to evade 
growth limits imposed by genotoxic stress 


Brian D. Larsen’, Jan Benada’, Philip Yuk Kwong Yung“t, Ryan A. V. Bell?+, George Pappas“t, 
Vaclav Urban®, Johanna K. Ahlskog’, Tia T. Kuo’, Pavel Janscak®°, Lynn A. Megeney’, 
Simon J. Elsasser’, Jiri Bartek2*°, Claus S. Sorensen’ 


Genotoxic therapy such as radiation serves as a frontline cancer treatment, yet acquired resistance 
that leads to tumor reoccurrence is frequent. We found that cancer cells maintain viability during 
irradiation by reversibly increasing genome-wide DNA breaks, thereby limiting premature mitotic 
progression. We identify caspase-activated DNase (CAD) as the nuclease inflicting these de novo DNA 
lesions at defined loci, which are in proximity to chromatin-modifying CCCTC-binding factor (CTCF) 
sites. CAD nuclease activity is governed through phosphorylation by DNA damage response kinases, 
independent of caspase activity. In turn, loss of CAD activity impairs cell fate decisions, rendering cancer 
cells vulnerable to radiation-induced DNA double-strand breaks. Our observations highlight a cancer- 
selective survival adaptation, whereby tumor cells deploy regulated DNA breaks to delimit the 


detrimental effects of therapy-evoked DNA damage. 


enotoxic cancer therapy inactivates and 

kills cancer cells by inflicting extensive 

DNA damage. Radiation therapy (RT) 

is the most broadly applied genotoxic 

challenge in standard-of-care oncolog- 
ical treatment. The deposition of energy as 
radiation passes through the genetic material 
triggers extensive DNA lesions often in the 
form of double-strand breaks (DSBs), single- 
stranded breaks (SSBs), and interstrand cross- 
links (J). The extent of this damage can present 
an insurmountable barrier to cellular fitness, 
triggering cell death or cell cycle withdrawal. 
Facilitating the DNA damage response (DDR) 
and lesion repair while avoiding cell death and 
cell cycle blockage is critical for cell survival 
after RT. Clinically, resistance to RT remains 
a considerable obstacle to effective tumor con- 
trol, as the cancer cells deploy an arsenal of 
mechanisms, still incompletely understood, to 
mitigate the effects of RT (2). 

Irradiated normal cells halt progression in 
the G, phase of the cell cycle by activation of 
p53 and pRB, which are key factors regulating 
cell cycle checkpoints. However, these factors 
are commonly inactivated in solid cancers, 
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leading to G, checkpoint deficiency. Combined 
with oncogene-driven premature S-phase entry, 
this scenario evokes replication stress and en- 
hanced chromosomal damage that requires 
efficient repair should the cell yield viable 
progenies when it divides (2). Hence, cancer 
cells particularly rely on the G, cell cycle check- 
point, preventing entry into mitosis with un- 
repaired DNA breaks (3). In addition, studies 
following the dynamics of RT-induced DNA 
lesions have indicated the presence of a tem- 
porally distinct and unexplained secondary 
wave of DNA breaks (4). To identify potential 
nuclease regulators of the Gz cell cycle check- 
point after radiation, we screened a library 
targeting the known human nucleases in can- 
cer cells (fig. S1) (5). The primary candidate 
emerging from this screen was RBBP8 (CtIP), 
an established DDR and Ga, checkpoint factor 
(6). Unexpectedly, a second robust target from 
this screen was DFFB (also known as caspase- 
activated DNase or CAD), a factor previously 
unassociated with DDR or cell cycle checkpoint 
control. CAD is the nuclease implicated in DNA 
fragmentation during apoptotic cell death as 
the effector of caspase signaling cascades (7, 8). 
Caspase-mediated cleavage of CAD’s chaper- 
one and inhibitor, ICAD (inhibitor of CAD), 
facilitates the dimerization of CAD, giving rise 
to the hallmark DNA fragmentation seen in 
apoptosis (8). 

CAD-inflicted DNA breaks and the ensuing 
DDR signals have also been implicated as in- 
ductive cues for a number of nonapoptotic cell 
fate states and transitions (9-15). Intriguingly, 
RT-induced DNA lesions encompass a tempo- 
rally distinct and mechanistically unexplained 
secondary wave of DNA breaks (4). Together, 
these observations led us to hypothesize that 


CAD might be responsible for inflicting these 
delayed post-irradiation DNA breaks to exert 
checkpoint control, potentially providing a 
mechanism of radioresistance. 


CAD promotes a wave of endogenous DNA 
breaks after exogenous DNA damage 


To address the nature of the endogenous DNA 
breaks that have been reported to appear after 
exposure to ionizing radiation (IR), we ir- 
radiated human wild-type and CAD null (KO) 
colorectal cancer-derived HCT116 cells and 
measured the extent of DNA damage by alka- 
line single-cell gel electrophoresis (Comet assay) 
(Fig. 1A and fig. S2A). We did not observe any 
initial differences in DNA lesion accumulation 
between wild-type and CAD KO cells after ex- 
posure to IR. Further, the progressive reduc- 
tion in DNA damage burden through active 
DNA repair was comparable between wild-type 
and CAD KO cells. However, consistent with 
recent observations (4), we detected a sec- 
ondary accumulation of DNA lesions in wild- 
type cells that was prominent 24 hours after 
IR (Fig. 1A). The extent of DNA breakage was 
dependent on IR dosage (Fig. 1B). DNA break 
accumulation at 24 hours was largely depen- 
dent on the nuclease activity of CAD (Fig. 1C). 
These CAD-dependent breaks were observed 
in a panel of human malignant cell lines, but 
not in cells of nonmalignant origin (Fig. 1D 
and fig. S2B). To support this finding, we used 
in situ nick translation (ISNT) to quantify DNA 
breaks (15). This approach revealed a similar 
CAD-dependent elevation in DNA breaks at 
24 hours after IR (Fig. IE). CAD is normally 
present in a protein complex with ICAD, which 
also acts as a chaperone required to properly 
fold CAD (8). Accordingly, we recapitulated 
our observations in cells lacking ICAD, where 
the expression of CAD is lost (fig. S2, C and D). 
Further, extensive DNA DSB formation could 
not account for the prevalent CAD-dependent 
DNA breaks seen by the alkaline approach, 
although a discrete population of DNA DSBs 
might exist (fig. S2E). 

PARP-1 [poly(ADP-ribose)polymerase-1] 
serves as a sensor of DNA lesions that triggers 
DNA repair and was previously implicated in 
the repair of DNA breaks after IR (4, 16). Con- 
sistent with the appearance of DNA breaks at 
24 hours after IR, we observed active ongoing 
DNA repair signaling, as evident from elevated 
poly(ADP-ribose) quantities (fig. S3A). Inhibi- 
tion of PARP with 4-ANI, a strong enzymatic- 
activity inhibitor with only weak PARP-trapping 
activity, led to an elevation in DNA breaks 
even when the inhibitor was added for the last 
2 hours of the 24-hour period after IR (fig. S3B) 
(17). The elevated PARP activity and increased 
DNA breaks upon PARP inhibition were both 
dependent on CAD (fig. $3, A and B). 

The extent of single-stranded DNA (ssDNA) 
at 24 hours after IR was also monitored by 
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native bromodeoxyuridine (BrdU) and rep- 
lication protein A (RPA) foci formation. Both 
approaches demonstrated elevated numbers 
of ssDNA foci in the control irradiated cells 
but not in cells deficient for CAD/ICAD at this 
time point (fig. S3, C to H). In support of this 
observation, RPA foci formation at 24 hours 
could be restored by transient expression of 
wild-type CAD but not a nuclease-dead (ND) 
CAD variant (fig. S3I). 

We postulated that elevation of DNA break 
quantities could signal a delayed chromatin 
response after IR. Consistent with this hypoth- 
esis, the maintenance of KAP1 phosphorylation, 
a chromatin marker of ongoing DDR, was 
dependent on CAD (fig. S4, A to C) (78). In 
addition to IR, we have observed a similar CAD- 
dependent signaling after doxorubicin-induced 
genotoxic damage, which also inflicts DNA 
DSBs (fig. S4, D to F). Collectively, these ob- 
servations suggest that CAD nuclease induces 
self-inflicted DNA breaks in cancer cells. Ca- 
nonically, caspase-3-mediated cleavage of ICAD 
releases CAD from inhibition, allowing for CAD 
dimerization. This positions each nuclease cleft 
in parallel across a DNA double-strand seg- 
ment. Each nuclease cleft creates a single- 
stranded break in DNA that together produce 
a DSB (8). Notably, CAD can also inflict DNA 
nicks during early apoptosis and skeletal mus- 
cle differentiation (79, 20). However, it has 
been reported that IR exposure of solid tumor 
cells does not elicit a robust caspase response 
(21). Here, we did not observe evidence of 
caspase-3 activation or proteolytic process- 
ing of ICAD after IR. Further, inhibition of 
pan-caspase activity did not affect the ob- 
served induction of DNA breaks, suggesting 
a noncanonical activation of CAD after IR (fig. 
S4, C, G, H, and I). 


Chromatin recruitment of CAD and ICAD 


We observed that both CAD and ICAD were 
recruited to the chromatin fraction of IR-treated 
cells (Fig. 2A and fig. S5A). ICAD interaction 
with CAD typically limits recruitment of the 
nuclease to DNA (22). However, the nuclease 
cleft of CAD is exposed in the CAD/ICAD 
heterodimer; thus, chromatin interactions 
could produce DNA nicks (22). To address 
whether chromatin recruitment of CAD/ICAD 
was sufficient to induce DNA breaks, we 
used the chromatin tethering model of U20S 
263 cells that harbors an integrated LacO array 
(23). Expressed mCherry-LacR-ICAD was cor- 
rectly recruited to the LacO array and could 
recruit a green fluorescent protein (GFP)- 
tagged CAD to this site (fig. S5B). Analysis 
of the mCherry-LacR-ICAD tethered arrays 
in comparison to the empty mCherry-LacR 
construct revealed an induction of DNA breaks, 
as characterized by the creation of 3’-OH DNA 
ends that could be readily detected by termi- 
nal deoxynucleotidyl transferase end labeling 
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Fig. 1. CAD promotes self-inflicted DNA breaks after IR. (A) Alkaline Comet assay of HCT116 wild-type 


(WT) and CAD KO cells upon 8 Gy of IR; a representative dataset is presented. Data are means + SEM; 


N =3,n> 100. ****P < 0.0001 (Kruskal-Wallis multiple-comparisons test); ns, not significant. (B) Alkaline 
Comet assay of HCT116 WT and CAD KO cells upon 2 or 8 Gy of IR; a representative dataset is presented. 
Data are means + SEM; N = 3, n > 100. *P = 0.0147, ****P < 0.0001 (Kruskal-Wallis multiple-comparisons test). 


(C) WT CAD, but not a ND CAD, restores DNA breaks as measured by the Alkaline Comet assay 24 hours 
after 8 Gy of IR; a representative dataset is presented. Data are means + SEM; N = 3, n > 100. 
****P < (0001 (Kruskal-Wallis multiple-comparisons test). (D) DNA break quantities 24 hours after 
8 Gy of IR in cancer cell lines HCT116, U2OS, DLD-1, and SW480 and in noncancer cell lines RPE1 and 


Tig3 in control (UNC) or CAD-depleted cells (siCAD); a representative dataset is presented. Data are 
means + SEM; N = 3, n > 100. *P = 0.0337, **P = 0.074, ***P = 0.0001, ****P < 0.0001 (Kruskal-Wallis 
multiple-comparisons test). (E) Relative ethynyl-deoxyuridine triphosphate incorporation into DNA breaks 
using ISNT in U20S (control or CAD-depleted) cells upon 6 Gy of IR; a representative dataset is presented. 
Scale bar, 10 um. Data are means + SEM; N = 3, n > 30. ***P < 0.001 (unpaired Student's t test). 
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Fig. 2. CAD/ICAD chromatin recruitment inflicts DNA breaks at defined 
genomic elements. (A) Immunoblotting of chromatin fraction of HCT116 cells 
upon 8 Gy of IR. Recruitment of p53 was used as a positive control; H3 was used as 
a loading control. (B) TUNEL end labeling of 3'-OH indicates the formation of 
DNA breaks in mCherry-LacR-ICAD transfected cells. Scale bar, 10 um. Data are 
means + SEM; N = 3,n > 50. **P < 0.005 (unpaired Student's t test). (©) Knockdown 
of CAD in mCherry-LacR-ICAD transfected cells reduces RPA and y-H2AX. N = 3, 
n> 20. ***P < 0.001 (unpaired Student's t test). RFI, relative fluorescence intensity. 
(D) Genome-wide landscape of SSBs in HCT116 WT or CAD KO cells, before, 

20 min after, and 24 hours after 8 Gy of IR. Average GLOE-seq read densities 
combined from three independent replicates are summarized over functionally 
distinct genomic regions as defined by a 10-state ChromHMM genome annotation. 
Elong, elongation; Enh, enhancer; Hetero, heterochromatin; Txn, transcription; 

UT, untreated. (E) SSB distribution around 56,546 CTCF binding sites in HCT116 cells. 
Average profiles from paired-end GLOE-seq fragments are plotted. (F) Footprint 
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analysis of core and linker histones and SSBs around 56,546 CTCF binding sites 
in HCT116 cells. Linker histone H1.0 genomic occupancies are well positioned 
with a ~160-bp periodicity flanking the central CTCF binding site. GLOE-seq nick 
sites are piled up separately for the plus and minus strand. SSBs peak symmetrically 
and in a defined direction adjacent to the linker histone. (G@) Three GLOE-seq 
replicates of each condition were filtered for duplicates and artificially over- 
represented regions, downsampled to the same sequencing depth (2 Mio reads), 
and examined for pileups of three or more unique forward reads within a 20-bp 
window indicating exact or near-exact coincidence of SSBs in multiple individual 
cells, thus “hypersensitive” nick sites. Data are means + SEM. *P = 0.011, 
****P = 5 6 x 10° (Student's t test and Bonferroni correction). (H) A comprehensive 
list of pileups was generated from pooled replicates. Each pileup region was 
matched against the pileup regions of the other five conditions to exclude common 
treatment-independent hypersensitive nick sites. WT cells showed more than 
1000 unique hypersensitive nick sites as compared to CAD KO cells. 
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Fig. 3. ATM/ATR signaling to ICAD regulates CAD-induced DNA breaks 
after IR. (A) U20S cells expressing mCherry-ICAD and GFP-CAD were 
microirradiated and imaged every minute for 25 min. Scale bar, 10 um. Data 
are means + SEM; cells per biological replicate, n = 6; biological replicates, 
N = 3. (B) Recruitment of CAD and ICAD to microirradiated laser stripes after 
inhibition of ATR. Conditions as in (A), except that ATRi AZ20 was added 


20 min before microi 
in (A). (C) ATR inhib 


HCT116 cells. (D) D 
in 


rradiation. RFl, SEM, replicates, and n/N are the same as 
ition (AZ20) for the indicated times prior to collection 


affects chromatin recruitment of CAD at 24 hours after 8 Gy of IR in 


A break measurements under conditions like those 


(C) in HCT116 cells. Data are means + SEM; N = 3, n > 100. *P < 0.0443 


(Kruskal-Wallis multiple-comparisons test). (E) Protein sequence analysis 
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(Clustal Omega) of ICAD indicates two conserved ATM/ATR SQ phosphoryl- 
ation motifs. (F) ICAD is phosphorylated at Ser!°” and Ser@°” after IR. 

CAD KO U2OS cells expressing GFP, WT ICAD, or DSA ICAD were irradiated 
(8 Gy) and GFP-tagged proteins enriched by GFP trap. Immunoblotting 

with phospho-specific antibodies to Ser!°” and Ser®°” on ICAD. (G) DSA- 
CAD is not stably recruited to laser-microirradiated damage. U2OS cells 
expressing GFP-WT ICAD or GFP-DSA ICAD were laser-microirradiated and 
imaged every minute for 25 min. Data are means + SEM; cells per biological 
replicate, n = 7; biological replicates, N = 3. (H) RPA foci in indicated cells 
24 hours after IR; N = 3, n > 50, box-and-whisker plot shows median and 
Oth to 90th percentiles. *P = 0.0324 [multiple-comparisons analysis of 
variance (ANOVA)]. 
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(TUNEL) (Fig. 2B). Activation of the DNA dam- 
age response was noted by increased RPA 
association, ATR (ataxia telangiectasia and 
RAD3-related) association, and phosphoryla- 
tion of histone variant H2AX (fig. S5, C to E). 
The DDR signaling from ICAD-tethered LacO 
arrays was dependent on CAD, as small inter- 
fering RNA (siRNA)-mediated knockdown of 
CAD reduced the recruitment of RPA and the 
phosphorylation of H2AX (Fig. 2C). Inhibition 
of caspase activity had no apparent effect on 
the recruitment of RPA to ICAD-tethered LacO 
arrays (fig. SSF). This demonstrates a capacity 
of CAD to mediate DNA break accumulation 
while associated with intact ICAD, independent 
of caspase signaling. 


CAD-dependent DNA breaks at defined 
genomic loci 


Together, our observations indicate that CAD- 
dependent DNA breaks appear to predomi- 
nantly manifest as SSBs. To determine whether 
these breaks were occurring at defined ge- 
nomic loci, we mapped SSBs at base-pair reso- 
lution in HCT116 wild-type or CAD KO cells 
before, 20 min after, and 24 hours after IR by 
GLOE-seq (genome-wide ligation of 3’-OH 
ends followed by sequencing, fig. S6A) (24). 
Examining natural SSB frequency as a func- 
tion of different chromatin states, we found a 
weak prevalence of SSBs at active enhancers 
and transcription initiation sites, known to be 
associated with accessible chromatin, as well as 
insulators as defined by CTCF binding (Fig. 2, D 
and E). CTCF has multiple functions in genome 
biology as it assists the three-dimensional (3D) 
folding of chromatin by regulating the location 
of chromatin loops formation (25). This SSB 
distribution was also maintained immediately 
after irradiation and was independent of the 
presence of CAD enzyme (Fig. 2D). However, 
24 hours after irradiation, a more distinctive 
distribution was observed in the presence of 
wild-type but not CAD KO cells: SSBs concen- 
trated more on insulator regions while be- 
coming relatively depleted in heterochromatic 
regions (Fig. 2, D and E). This shift in pattern 
was entirely consistent among replicates (fig. 
S6B). Consistent with the chromatin state analy- 
sis, we also observed a high enrichment of SSBs 
at and around CTCF binding sites 24 hours after 
irradiation, in a CAD-dependent manner (Fig. 2E). 

We sought to more precisely pinpoint the 
SSBs around CTCF sites at base-pair resolution, 
using the mapping information of the first 
read, which identifies the exact nick ligation 
site and strand. Piling up nick sites around 
CTCF binding sites revealed a periodic pattern 
(Fig. 2F and fig. $7, A and B), with nicks in the 
plus and minus strand being separated by 
185 base pairs. Nucleosomes are known to 
be well positioned around CTCF sites (26); 
hence, we used published acetylated histone 
H3 and linker histone chromatin immuno- 
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precipitation sequencing (ChIP-seq) data to 
delineate the position of core and linker his- 
tones (27, 28). Matching GLOE-seq with these 
ChIP-seq patterns revealed a consistent peri- 
odicity of core, linker histone, and SSB, with 
SSBs arising symmetrically left and right of 
the linker histone footprint (Fig. 2F). This 
suggested that DNA bound by the nucleosomes 
and linker histone DNA is protected from SSBs. 
The nick was introduced in a strand-specific 
manner, with the plus strand being nicked 
on the “plus” side of the linker histone, and the 
minus strand on the “minus” side of the linker 
histone; this suggests a potential role for his- 
tones (or chromatin structure in general) in 
directing and orienting CAD activity in a 
strand-specific manner. 

CAD-induced SSBs appeared to be less ran- 
dom than naturally occurring SSBs. We noted 
initially that multiple unique GLOE-seq reads 
tended to accumulate at relatively few sites 
in the genome at 24 hours after irradiation, 
but not in any other condition. Thus, we used 
an unbiased approach to quantify SSB hot- 
spots (more than three unique SSBs mapped in 
close proximity on the same strand). Twenty- 
four hours after irradiation in wild-type but 
not CAD KO cells, we observed a statistically 
significant (P = 5.6 x 10°°) factor of 2 in- 
crease across the three experimental replicates 
(Fig. 2G). A large number of the hypersen- 
sitivity sites that arose 24 hours after IR were 
new. We termed these CAD-dependent SSBs 
(CdSSBs), and they appeared to accumulate 
at hotspots different from those generally 
sensitive to occurrence of SSBs. Comparing 
wild-type and CAD KO cells 24 hours after 
irradiation, we observed 1371 unique pileups 
in wild-type cells, corresponding to putative 
CdSSBs, whereas ~581 unique pileups were 
present in the CAD KO but not wild-type cells 
(Fig. 2H). Notably, 232 of these 1371 CdSSBs 
overlapped with published CTCF peaks and 
195 with DNase hypersensitive sites. Of the 
unique 581 pileups found in CAD KO cells, 
the overlap was only 11 and 14, respectively. In 
summary, genome-wide SSB mapping revealed 
a characteristic, unusual, CAD-dependent SSB 
landscape 24 hours after irradiation. 


DDR signaling through ICAD coordinates CAD 
activity after genotoxic stress 


Next, we investigated whether CAD and ICAD 
are embedded in the DDR signaling machin- 
ery in response to genotoxic stress. To explore 
this, we examined the recruitment of CAD/ 
ICAD to stripes of microirradiated DNA in 
real time. Here, we observed that both ICAD 
and CAD were recruited with comparable ki- 
netics to subnuclear regions of DNA breaks 
(Fig. 3A and movies S1 and $2). Unexpectedly, 
we noted that ICAD could also be recruited to 
chromatin after IR in the absence of CAD, 
which could indicate that the recruitment of 


CAD is mediated in part by ICAD after IR 
(fig. S8A). Examination of the recruitment to 
microirradiated DNA of a series of ICAD trun- 
cation fragments in ICAD-deficient cells in- 
dicated that all fragments could be recruited 
to irradiated regions. Fragments lacking 
the C-terminal domains of ICAD displayed 
a modest delay in accumulation (fig. S8, B 
to D). These results indicate that multiple 
regions of ICAD are responsible for chromatin 
recruitment. 

Next, we examined whether a DDR signal 
could regulate the recruitment of CAD/ICAD 
to microirradiated stripes. We noted that in- 
hibition or loss of ATM (ataxia telangiectasia 
mutated) and ATR kinase activity could limit 
the recruitment of CAD/ICAD (Fig. 3B and fig. 
S9, A to D). A similar role of the ATR kinase 
was also observed the day after IR, where ATR 
inhibition transiently diminished CAD chroma- 
tin recruitment as well as the corresponding 
number of DNA breaks (Fig. 3, C and D). To 
determine whether ATR regulates ICAD more 
directly, we analyzed the sequence of ICAD and 
identified two potential ATM/ATR phosphory- 
lation sites, Ser’®’ and Ser®*”, that are highly 
conserved across mammalian species (Fig. 3E). 
Further, the phosphorylation of Ser””” on ICAD 
was previously identified in phosphoproteome 
analysis of human cells exposed to IR (29). We 
assessed the propensity of ATR to phosphory- 
late ICAD in vitro, and demonstrated that ICAD 
can be directly phosphorylated by ATR, depen- 
dent on the ICAD SQ sites (fig. S9E). To further 
address the biological relevance of ICAD phos- 
phorylation, we generated phospho-specific 
antibodies toward the Ser’®” and Ser”” sites. 
Both sites appeared extensively phosphorylated 
after cell exposure to IR, and the phosphory- 
lation was dependent on both ATM and ATR 
kinase activity 24 hours after IR (Fig. 3F and 
fig. S9, F to I). Next, we expressed a serine-to- 
alanine mutant form of ICAD that could not 
be phosphorylated (S107A and S257A; DSA) 
to investigate the functional relevance of these 
phosphorylation events (Fig. 3F and fig. S9F). 
Measuring recruitment to microirradiated 
stripes of DNA damage demonstrated that 
the DSA variant could not be stably recruited 
to these sites, unlike the wild-type ICAD (Fig. 
3G). In addition, the DSA variant could not 
completely restore RPA foci formation at 
24 hours after IR even though expression of 
CAD was restored (Fig. 3H and fig. S9F). This 
indicates that the ATR/ATM-dependent phos- 
phorylation of ICAD functionally contributes 
to regulating the induction of DNA breaks and 
continued control of the checkpoint through 
CAD/ICAD after IR. 


CAD is required for cell cycle checkpoint 
control and tumor cell survival after IR 


Next, we investigated the role of CAD/ICAD- 
dependent maintenance of the G, cell cycle 
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checkpoint. We confirmed premature mitotic 
entry observed in the siRNA-based screen using 
individual siRNAs targeting CAD and ICAD 
and in the CAD-deficient KO cells (Fig. 4, A 
to C). This revealed that the breakdown in Gz 


checkpoint control in CAD-deficient cells was 
most pronounced 24 hours after IR, which 
corresponded with our observed peak of CAD- 
inflicted DNA breaks (Fig. 4, B and C). To 
further characterize the molecular basis of 
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the CAD/ICAD-dependent checkpoint reg- 
ulation, we noted reduction of the inhibitory 
phosphorylation of cyclin-dependent kinase 
CDK1 (Tyr’’), reduction in active checkpoint 
kinase CHK2, and moderate difference in 


Fig. 4. CAD/ICAD are required for tumor cell 
survival and checkpoint maintenance after 
IR. (A) Mitotic (pS10-H3) staining of irradiated 
and nocodazole (NZ)-trapped U2OS cells 
transfected with control siRNA (UNC) and siRNA 
against CAD. Cells were irradiated with 6 Gy 

of IR; after a 2-hour recovery, NZ was added for 
8 hours before cells were fixed for immuno- 
fluorescence. Scale bar, 50 um. (B) Go 
checkpoint maintenance in HCT116 CAD siRNA 
knockdown cells upon 8 Gy of IR. Data are 
means + SEM; N = 3. *P < 0.05 (unpaired 
Student's t test). (C) Gz checkpoint mainte- 
nance in HCT116 CAD KO cells 24 hours after 
8 Gy of IR. Data are means + SEM; N = 3. 

*P < 0.05 (unpaired Student's t test). (D) G2 
checkpoint maintenance in U20S ICAD KO cells 
24 hours after 6 Gy of IR. Data are means + 
SEM; N = 3. *P < 0.05 (unpaired Student's 

t test). (E) Relative colony outgrowth of 
CAD-depleted cells after IR exposures. 

Data are means + SEM; N = 4. *P < 0.05 
(unpaired Student's t test). (F) Relative colony 
outgrowth of CAD KO HCT116 cells upon 
indicated IR exposure. Data are means + SEM; 
N = 4. *P < 0.05 (unpaired Student's t test). 
(G) Genomic instability in U2OS WT and ICAD 
KO cells upon 8 Gy of IR. % Genomic instability 
represents total number of cells displaying 
micronuclei and fragmented nuclei divided by 
the total number of cells. Data are means + 
SEM; N = 3, n > 400. **P = 0.0016 (ANOVA). 
(H) Immunoblotting of pY701 STAT1 3 days after 
6 Gy of IR in WT and ICAD KO cells. CDK1 
inhibitor RO-3306 was added 2 hours after IR. 
(1) Normalized tumor growth of HCT116 WT and 
HCT116 CAD KO tumors after 4 Gy of IR. Data 
are means + SEM; n = 6. **P = 0.0055 (two-way 
multiple-comparisons ANOVA). (J) Model of 
CAD-dependent Gz phase checkpoint. 
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active CHK1 (fig. S10, A to E). CDK1 inhi- 
bition is required to restrict mitotic entry 
after IR. Additionally, activated CHK2 and the 
phosphorylation of KAP1 have also been im- 
plicated in controlling mitotic entry of cells 
after IR (30). In line with our previous obser- 
vations, this checkpoint function appeared 
independent from caspase signaling and un- 
related to apoptotic cell death (fig. S11, A to 
C). Further, a CAD-promoted G, checkpoint 
was not observed in nonmalignant cells (fig. 
S11D), as normal cells harbor proficient p53 
and pRB pathways that ensure robust G,/S 
transition control and diminished G, check- 
point dependency. The timing of Gz check- 
point breakdown in CAD-deficient cancer 
cells corresponded to the kinetics of CAD- 
inflicted DNA breaks, indicating that this 
DNA modification may functionally prevent 
premature mitotic entry after IR. Notably, 
CAD/ICAD-deficient cells that entered mito- 
sis prematurely exhibited a high number of 
lagging chromosomes and chromatin bridges 
(fig. SIE). 

Aberrant cell cycle progression and pre- 
mature mitotic entry with unrepaired DSBs 
leads to mitotic cell death, which contributes 
to radiosensitivity (21, 31). Indeed, cancer cells 
lacking the expression of CAD or ICAD dis- 
played increased radiosensitivity (Fig. 4, E 
and F, and fig. S12A). This phenomenon was 
selective for cancer cells, as loss of CAD in 
nonmalignant cells had no impact (fig. S12B). 
Failure to repair DNA breaks through the in- 
hibition of PARP activity has been suggested 
to sensitize cells to IR (32). Therefore, we 
examined whether failure to repair CAD- 
inflicted DNA breaks would contribute to this 
sensitization. We found that CAD-proficient 
cancer cells were sensitized to radiation by the 
addition of PARP inhibitor 24 hours after IR. 
However, PARP-inhibited CAD-deficient cells 
did not display any additional sensitization to 
IR (fig. $12C). Additionally, we noted that the 
loss of G. cell cycle checkpoint control led to 
the increased incidence of unstable nuclei 
(nicronuclei and fragmented nuclei) (Fig. 4G 
and fig. S1I1E). Such genomic instability is con- 
sidered to be a potent molecular pattern signal 
that activates inflammatory STATI signaling 
(33). Thus, we assessed the activating phos- 
phorylation of STATI at Tyr™ after IR in wild- 
type and CAD-deficient cells. This revealed 
elevated p-Tyr” STATI in CAD-deficient cells 
after IR, which was dependent on progression 
through mitosis (Fig. 4H). These observations 
are in line with recent research demonstrating 
that premature progression through mitosis 
after radiation promotes STATI signaling, and 
our data further indicate that CAD/ICAD limits 
this response (33, 34). 

To complement the cell-based observa- 
tions, we analyzed CAD function in a model of 
tumor radiotherapy in vivo, using human 
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tumor xenografts and tumor growth after ra- 
diation (Fig. 41 and fig. S12, D to F). Consis- 
tent with the cell-based survival assessment, a 
pronounced negative impact on tumor growth 
was detected after irradiation in tumors de- 
ficient in CAD, relative to their CAD-proficient 
counterparts. The irradiated CAD-deficient 
tumors demonstrated elevated p-Tyr! STATI 
compared to the wild-type tumors at endpoint 
(fig. S12G). Collectively, these results support 
the concept that the CAD-dependent pathway 
actively promotes cancer cell survival after IR. 
Previous murine studies had implicated CAD 
as a potential tumor suppressor, which was 
linked to pro-apoptotic function (10). How- 
ever, analysis of gene expression data com- 
paring normal and malignant tissues in human 
cancers indicated that loss of function of CAD/ 
ICAD is a rare event (fig. S13, A and B) (35). 
Further, elevated expression of CAD in par- 
ticular was noted in multiple tumor types (fig. 
$13, A and B)—an observation that is con- 
sistent with a potential, as yet unidentified, 
tumor-supporting role for CAD. 


Discussion 


Together, our results unravel a DDR-mediated 
G» phase checkpoint pathway where cancer 
cells exposed to IR inflict reversible CAD- 
dependent DNA breaks including the CdSSBs. 
These lesions stimulate signaling responses 
and prevent premature mitotic entry (Fig. 4J), 
which enhances cancer cell survival. As repair 
of IR-induced DNA damage progresses, the 
number of highly genotoxic complex DNA 
DSBs declines, dropping below a threshold 
required to maintain the checkpoint. In turn, 
the induction of CAD-dependent DNA breaks 
signals an amplification of the DDR, further 
stabilizing the G2 checkpoint and thereby pro- 
viding more time for repair of the more com- 
plex, difficult-to-repair, and potentially lethal 
IR-induced genotoxic lesions. 

The observations presented here indicate 
that the CAD-mediated checkpoint signal is 
primarily dependent on the generation of 
CdSSBs, a form of DNA damage characterized 
by rapid repair kinetics. Further, we implicate 
the activity of PARP-1 in the repair of CAD- 
mediated DNA breaks, as the addition of a 
nicotinamide adenine dinucleotide (NAD)- 
like PARP-1 inhibitor, 4-ANI, promotes DNA 
break formation in a CAD-dependent manner. 
Inhibition of PARP-1 activity has been reported 
to impair the G, checkpoint but enhance the G, 
checkpoint in irradiated cells (36), which may 
be the result of loss of PARP-1-directed repair 
of CdSSBs leading to increased DNA lesion 
burden. Furthermore, we uncovered an acti- 
vating role of ICAD phosphorylation mediated 
by the DNA damage-induced ATM and ATR 
kinases. This may suggest an ongoing DNA 
damage-mediated feedback loop that is active 
until repair of complex lesions is completed, at 


which stage the kinase signaling declines. In 
this regard, we note that multiple posttrans- 
lational modification sites have been identified 
in ICAD (www.phosphosite.org/proteinAction. 


action?id=9541&showAllSites=true). These 


additional sites suggest that multiple signaling 
events may converge to regulate the CAD/ICAD 
checkpoint pathway. 

Prior observations had established a tempo- 
rally delayed secondary wave of SSBs after IR; 
however, the origin and functional role of these 
lesions have remained obscure. We identified 
CdSSBs as the source of these lesions, which 
accumulate at a subset of CTCF sites in the ge- 
nome. These genomic loci may be accompanied 
by a moderate number of CAD-dependent 
stochastic lesions that escape detection because 
of rare targeting events within unspecified 
regions. Functionally, CTCF sites serve as 
binding domains for the CTCF protein, which 
regulates 3D chromosomal looping and topo- 
logically associated domains (TADs) in inter- 
phase cells (25). In response to positioned 
DSBs, chromosomal loops form in proximity 
to the CTCF sites, which act to sculpt the 
chromatin spreading of the phosphorylated 
histone variant yH2AX. Given that CdSSBs 
extend cell cycle checkpoint control, an appeal- 
ing concept is that the newly formed SSBs help 
to enforce a chromatin response to DNA dam- 
age. This is supported by the CAD-dependent 
phosphorylation of KAP1, a major chromatin 
marker of ongoing DDR. Here, CAD nuclease 
activity is restricted to generate CTCF-directed 
SSBs after IR, which is in contrast to CAD- 
dependent genome-wide intranucleosomal 
cleavage events during apoptosis. Hence, the 
precision of strand break formation may be a 
determinant in how CAD guides cancer cell 
survival. 

On the basis of our present study, we pro- 
pose that CAD/ICAD signaling is an adaptive 
cancer-intrinsic mechanism to resist genotoxic 
stress. The apparent selectivity of this stress- 
tolerance pathway likely reflects multiple fac- 
tors that occur in cancer but not normal cells, 
including defects in the p53 and pRB path- 
ways controlling G,/S transition, oncogene- 
driven premature S-phase entry, enhanced 
replication stress, and defective DNA repair, 
as well as checkpoint signaling mechanisms. 
The CAD/ICAD-mediated pathway may reflect 
adaptation to the genome-destabilizing selec- 
tive pressures during tumorigenesis and con- 
tribute to therapy resistance. Such a prosurvival 
checkpoint pathway also reveals a cancer- 
selective vulnerability, thereby providing a 
potential avenue to enhance tumor cell ra- 
diosensitivity by targeting this Gz cell cycle 
checkpoint. 
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Molecular and neural basis of pleasant touch sensation 


Benlong Liu’, Lina Qiao't+, Kun Liu’+§, Juan Liu’, Tyler J. Piccinni-Ash’, Zhou-Feng Chen?2* 


Pleasant touch provides emotional and psychological support that helps mitigate social isolation and 
stress. However, the underlying mechanisms remain poorly understood. Using a pleasant touch— 
conditioned place preference (PT-CPP) test, we show that genetic ablation of spinal excitatory 
interneurons expressing prokineticin receptor 2 (PROKR2), or its ligand PROK2 in sensory neurons, abolishes 
PT-CPP without impairing pain and itch behaviors in mice. Mutant mice display profound impairments in 
stress response and prosocial behaviors. Moreover, PROKR2 neurons respond most vigorously to gentle 
stroking and encode reward value. Collectively, we identify PROK2 as a long-sought neuropeptide that 
encodes and transmits pleasant touch to spinal PROKR2 neurons. These findings may have important 
implications for elucidating mechanisms by which pleasant touch deprivation contributes to social avoidance 


behavior and mental disorders. 


ur sense of touch is composed of dis- 

criminative and affective components. 

Discriminative touch detects physical 

properties of tactile stimuli (e.g., loca- 

tion, shape, texture, force, etc.), whereas 
affective touch conveys emotional value that 
is modulated by social context (J, 2). Pleasant 
touch (e.g., cuddling, caressing, and hugging) 
encodes positive hedonic information that fa- 
cilitates emotional development, affiliative be- 
havior, and the well-being of social animals 
C, 3, 4). Social touch is one of the most favored 
activities that might be evolutionarily con- 
served throughout the animal kingdom (5, 6). 
In nonhuman primates, rodents, birds, and 
insects, allogrooming behavior (or allopreen- 
ing for birds) is important for strengthening 
and maintaining social bonding, reciprocity, 
attachment, and hierarchy (7-9). Acute social 
isolation increases social cravings and reward- 
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seeking behavior (10). Harlow’s pioneering 
work demonstrated that infant rhesus monkeys 
separated from their mothers have an innate 
desire to cuddle soft cloth for contact comfort 
and emotional needs, and maternal touch is 
vital for the behavioral and psychological de- 
velopment of offspring (77). Likewise, long- 
term deprivation of maternal care and positive 
social touch has lasting negative consequences 
on the mental health of children (72, 73). In 
fact, affective touch avoidance and deficiency 
are some of the hallmarks of several neuro- 
psychiatric disorders, including autism spec- 
trum disorders (ASDs) (14, 15). Studies in 
humans have shown that C tactile (CT) fibers 
innervating hairy skin encode positive valence 
of social touch (/, 16-19), whereas MrgprB4- 
expressing sensory neurons and Gpr83-expressing 
spinal projection neurons have been impli- 
cated in mice (20, 27). Despite its profound 
importance, how pleasant touch information 
is encoded and transmitted from somato- 
sensory neurons to the spinal cord remains 
unknown. Our understanding of the mole- 
cules and neural circuits of pleasant touch has 
been hampered by a paucity of suitable animal 
models and methodologies that permit accu- 
rate inference and assessment of the affective 
state of mice that experience pleasant touch. 
Unlike discriminative touch, affective touch 
mediated by unmyelinated C fibers is a slow 
process (7). We postulated that pleasant touch 
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Fig. 1. PROKR2 neurons are a A 
unique population of spinal 

excitatory interneurons. (A to 3 
E) Double staining of GFP with § 
PKCy at three levels of the spinal 
cord (A) and GFP with various 
markers in the lumbar cord [(B) 
and (C)]. Arrows indicate double- 
labeled cells, and arrowheads indi- 3 
cate GFP only. n = 3 mice. Scale z 
bars, 100 um [(A) and (C)] and 3 
20 wm (B). (D) Quantification 

of (C), percentage of Prokr2°"P E 
cells in lamina II (green) and 
lamina | (blue). (E) Quantification 

of (B) and (C). Red in (E) indicates 

the percentage of double-labeled 

cells of Prokr2“* cells, and blue 
indicates GFP only. (F) Schematic 

of intraspinal injection of virus 
AAV5-Efla-DIO-EGFP-2a-TK-WPRE- 

pA (TK-GFP) and HSV-dTK-LSL- 
tdTomato (HSV-dTK-Tdtomato) in 

the dorsal horn of Prokr2“® mice | 
at the lumbar level (left). (Right) 
Image showing virus expression in a 
starter neuron (yellow) expressing 
GFP and Tdtomato. Scale bar, 

20 wm. (G) Double staining 

of Gpr83 or NKIR anterogradely 
labeled with Tdtomato. Scale bar, 
20 um. (H) Quantification of 

(G). n = 3. (I to K) Schematic of the 
whole-cell patch-clamp recording 
of Prokr2°FP neurons in the 

spinal cord slice preparations (I), 

a representative trace of the initial 
firing pattern (red) and single- 
spike firing pattern (blue) at 20 pA 
rheobase (Rb)] and 40 pA 

twofold rheobase (2 x Rb)] (J), 
and proportions of different types 
of firing pattern (K). n = 66 
neurons. (L to N) Schematic of 
the recording of the type of inputs 
onto Prokr2e*? 


8 


& 


Overtap (%Prokr2? neurons) 
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is encoded by slow-acting neuropeptides in C 
fibers and their cognate excitatory G protein- 
coupled receptors (GPCRs) in laminae II of 
the spinal cord that consist of microcircuits 
for relaying discrete sensory modalities from 
primary afferents to the brain (22). In a search 
for lamina- and modality-specific GPCRs, we 
found that prokineticin receptor 2 (PROKR2) 
is uniquely expressed in laminae II of the 
spinal cord, and PROKR2 neurons represent 
a previously unknown population of spinal 
excitatory interneurons. Using an unbiased 
behavioral paradigm in combination with 
physiological tests, extracellular recording, 
and genetic approaches, we sought to examine 
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the role of the PROK2-PROKR2 signaling in 
pleasant touch. 


Properties of spinal PROKR2-expressing 
lamina II excitatory interneurons 


We used Prokr2“” transgenic mice as a sur- 
rogate to characterize PROKR2 expression in 
the spinal cord. RNAscope in situ hybridiza- 
tion (ISH) followed by immunohistochemical 
(IHC) studies indicated that green fluorescent 
protein (GFP) of Prokr2“” mice recapitulates 
a large part of Prokr2 expression in laminae IT 
(80.4%) of the spinal cord (Fig. 1, A to E). A 
significant fraction of GFP is distributed in the 
dorsal side of the lamina II inner (Ili) layer 
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neurons with dorsal root stimulation (L), representative traces of different types of inputs (M), and their proportions (N). eEPSCs, evoked excitatory 
postsynaptic currents. n = 41 neurons. All data are presented as means + SEMs. 


innervated by isolectin B4 (IB4)-binding non- 
peptidergic fibers (Fig. 1C). Prokr2 is largely 
colocalized with the excitatory genes, like ve- 
sicular glutamate transporter 2 (Vglut2) (94.1%) 
and LmxiIb (96.8%), but rarely overlaps with 
the inhibitory neuronal markers, such as vesi- 
cular y-aminobutyric acid (GABA) transporter 
(Vgat) (6.4%) or Pax2 (3.1%) (Fig. 1, B and 
E, and fig. S1, A and B). RNAscope ISH and 
IHC showed that Prokr2 rarely overlaps with 
gastrin-releasing peptide receptor (Grpr) (7.8%), 
an itch-specific receptor expressed in laminae I 
and II interneurons (23, 24), or protein kinase C 
y (PKCy) (5.7%), which labels the ventral side 
of the lamina IIi layer, with Gpr83 (7.0%) or 
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Fig. 2. PROKR2 neurons are dispensable for acute pain and itch behaviors. 
(A) Schematic of intraspinal injection of Cre-dependent adeno-associated 

virus (AAV) virus expressing YFP. (B and C) Double staining of YFP and Prokr2 
(arrow indicates a double-stained cell) in the lumbar cord (B) and percentage 
of overlapping cells (C). Scale bars, 50 um (left) and 5 um (right). (D) Strategy 
for intersectional genetic ablation of spinal Prokr2°° neurons. Prokr2°® mice 
were mated with Lbx1"°, Tau*s°"® and Prokr2°" lines to generate diphtheria 
toxin receptor (DTR)-expressing Prokr2 neurons by injection of diphtheria toxin 
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(50 ug/kg, i.p.). (E and F) Images of Prokr2°"? neurons in WT and ABL 

mice (E) and quantification of Prokr2°"? neurons (F). Scale bar, 150 um. (G to 

L) Comparable latencies in hot plate (G), Hargreaves (H), and cold plate (I) tests; 
withdraw threshold in von Frey test (J); licking or flinching time induced by capsaicin 
(2 g, i.pl.) (K); and scratching numbers induced by chloroquine (200 ug, i.d.) (L) 
between WT and ABL mice. i.pl., intraplantar injection; i.p., intraperitoneal injection; i.d., 
intradermal injection. n = 3 (F); n = 8 to 9 [(G) to (L)]. Statistics by unpaired t test 
in (F) and (G) to (I). ***P < 0.001; n.s., not significant. Error bars indicate SEMs. 


NK1R—two markers expressed in distinct sub- 
sets of projection neurons—implicated in af- 
fective touch and pain, respectively (20) (Fig. 1, 
B, C, and E). To identify the downstream tar- 
get of PROKR2 neurons, we next performed 
Cre-dependent virus-mediated anterograde 
monosynaptic tracing in the spinal cord of 
Prokr2™ mice using HSV-dTK-LSL-tdTomato 
as the monosynaptic tracer to follow the out- 
put neurons (Fig. 1F). RNAscope ISH or IHC 
revealed that ~85.5% of the output neurons 
labeled by tdTomato express Gpr83, whereas 
only 8.5% express NKIR (Fig. 1, G and H). 
We next examined whole-cell patch-clamp 
recordings of spinal cord slices obtained from 
Prokr2“"" mice. The firing pattern of most 
Prokr2“? neurons was very homogeneous: 
~92.4% displayed initial bursting firing, with 
a few neurons showing single-spike firing 
(Fig. 1, I to K). To identify the type of peri- 
pheral sensory inputs, we recorded the re- 
sponse of Prokr2™” neurons in the parasagittal 
section of spinal slices obtained from Prokr2@” 
mice with the dorsal root (L4: to L5) attached, 
using the dorsal root stimulation method as 
described previously (Fig. 1L) (25). Prokr2°? 
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neurons predominantly receive monosynaptic 
and polysynaptic C fiber inputs (a combined 
85.4%) with a small fraction of polysynaptic 
A6é (4.9%) or AB inputs (9.8%) (Fig. 1, M and 
N). Furthermore, PROK2 application evoked 
subthreshold depolarizations in most Prokr2@” 
neurons, which were blocked by PKRA7, a 
PROKR2 antagonist (fig. $1, C and D). 


PROKR2 neurons are dispensable for pain and 
itch transmission 


To assess the function of spinal PROKR2 
neurons, we first validated Prokr2“° mice 
by intraspinal injection of Cre-dependent yel- 
low fluorescent protein (YFP) virus. Most 
Prokr2“°C neurons expressed Prokr2 (81.2%) 
(Fig. 2, A to C). We then used an intersectional 
genetic strategy to ablate spinal PROKR2 neu- 
rons in Prokr2“° mice, hereafter referred to as 
ABL mice, as previously described (25) (Fig. 2, 
D to F, and fig. S2, A and B). The specificity of 
ablation was demonstrated by the loss of most 
Prokr2“ neurons (84.9%) and Prokr2 (81.3%) 
without affecting PKCy, neurokinin B (NKB), 
and Grpr in the spinal cord or Prokr2@ in 
discrete brain regions (Fig. 2, D to F; fig. S2, 


C and D; and fig. $3). A battery of pain and 
itch tests was conducted to assess the role of 
Prokr2™ neurons in somatosensory transmis- 
sion. We did not observe a statistically sig- 
nificant difference in thermal pain, cold pain, 
mechanical thresholds, or inflammatory pain 
induced by capsaicin between ABL mice and 
their littermate control mice [referred to as 
wild-type (WT) mice; Fig. 2, G to K]. Chemi- 
cal and mechanical itch as well as the hairy 
skin sensitivity to a sticky tape of ABL mice 
were comparable to those of WT mice (Fig. 
2L and fig. S2, E and F). 


PROKR2 neurons convey pleasant 
touch sensation 


As a weak stimulus, gentle stroking with a soft 
brush on the hairy skin of mice does not elicit 
a robust motor response that can be used as 
a proxy to quantify pleasant touch sensation. 
Moreover, using a hand or soft brush to sim- 
ulate pleasant touch in mice often causes 
avoidance behavior without habituation and 
conditioning. To circumvent these confounds, 
we developed a protocol that includes two 
procedural features to avoid the stress and 
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anxiety associated with handling. First, given 
that social isolation increases the urge for 
social touch and attachment (26), mice were 
single-housed for 1 week followed by week- 


A 


Zs ies Ss BB 


long daily stroking sessions in the home cage. 
This procedure conditioned mice into a quies- 
cent or inactive state upon gentle stroking, 
akin to pets being groomed (Fig. 3, A and B, 


and movies S1 to $3). Second, to increase the 
motivational drive to obtain rewards associ- 
ated with pleasant touch (27), we performed 
eight conditioning sessions (one session per 
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Fig. 3. PROKR2 neurons transmit pleasant touch sensation. (A) A schematic 
diagram of the experimental procedure for PT-CPP. (B) Photo of a mouse being 
stroked with a soft brush. (C and D) Representative trajectory plots (C) and time 
spent (D) in stroking-paired chamber for WT and ABL mice in the preconditioning 
baseline (Pre-c BL) and preference test (Pref T). Pre-c BL versus Pref T: P < 0.01 for 
WT and P = 0.6822 for ABL mice. (E and F) Heart rate (E) and thermal pain 
threshold (F) in the baseline and poststroking test. Heart rate, BL versus 
poststroking: P < 0.01 for WT and P = 0.1563 for ABL mice; hot plate, BL versus 
poststroking: P < 0.001 for WT and P = 0.5206 for ABL mice. bpm, beats per 
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minute. (G@) Schematic of intraspinal injection of Cre-dependent AAV virus expressing 
ChR2-eYFP or eYFP (top left) and image showing ChR2-eYFP in the cervical cord 
(top right). Schematic of real-time place preference test (RTPP) (bottom). Scale bar, 
50 um. (H and 1) Representative heatmap (5 Hz) (H) and the percentage of time 
spent (I) in the chamber paired with the laser in the RTPP test. n = 9 [(D) to (F)]; 
n = 6 (I). Statistics by two-way repeated measures analysis of variance (ANOVA) 
followed by Bonferroni's multiple comparisons test [(D), (E), and (F)] or by two- 
way ANOVA followed by Bonferroni's multiple comparisons test (I). **P < 0.01; 
***P < 0.001; n.s., not significant. Error bars indicate SEMs. 
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2 days) using an unbiased two-chamber pleas- 
ant touch-conditioned place preference (PT- 
CPP) apparatus in which WT mice displayed 
no preference for either chamber (Fig. 3A, fig. 
S4B, and movie S4). On the test day, we per- 
formed a PT-CPP test to evaluate whether mice 
would spend more time in the chamber paired 
with gentle stroking than without. Both male 
and female mice developed PT-CPP as they 
spent significantly more time in the chamber 
paired with a soft brush (fig. S4, A, C, and D). 
These results demonstrate that gentle stroking 
in mice encodes the positive valence or hedonic 
value. In marked contrast to WT mice, ABL 
mice completely failed to show PT-CPP (Fig. 3, 
C and D), which indicates a profound loss of 
pleasant touch sensation. If PROKR2 neurons 


Fig. 4. PROKR2 neurons A 

display characteristic features } | _— 

in response to gentle stroking. tee ee 
tirade 


(A) (Left) Dorsal view of an 
anesthetized Prokr2°'°-Ai32 
mouse showing the position of 
optical fiber and electrode 
implanted in the lumbar spinal 
cord. (Right) Cross-sectional 
view of the lumbar spinal cord 
showing optogenetic tagging Cc D 
of Prokr2°"®? neurons in lamina II 

with blue lights on. (B) Example 
response of a PROKR2- 

expressing neuron to blue 

light activation. (Left) Spike raster 
showing multiple trials of laser 
stimulation at 1 Hz. (Right) The 

firing rate of one opto-tagged 

neuron within 10-ms light pulses. E 
(Inset) Waveform on expanded 


skin. Dots represent spike rate of 


convey pleasant touch, their direct activation— 
in the absence of either primary afferent input 
or behavioral conditioning or context—should 
be positively reinforcing. We injected Cre- 
dependent channelrhodopsin-2 (ChR2) or en- 
hanced YFP (eYFP) virus intraspinally into 
Prokr2™ mice followed by the real-time place 
preference test (RTPP) using optogenetics 
(Fig. 3G). Consistently, Prokr2 mice pre- 
ferred the chamber paired with photostimu- 
lation of PROKR2 neurons (5 and 10 Hz), 
whereas Prokr2“***¥? mice showed no pref- 
erence (Fig. 3, H and I). Together, these results 
demonstrate that PROKR2 neurons encode 
positive valence and rewarding value. 
Apart from pleasant sensation (J), gentle 
stroking on the hairy skin of humans de- 
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a single trial from individual 
neurons (E). (F to H) Schematic 
of brush stroking in different 


directions (F) (L-R: from left to right; R-C: from rostral to caudal), representative 
traces (top) and corresponding PSTHs (1-s bin) (bottom) (G), and firing rate 
(H) of spinal Prokr2°"** neurons in response to stroking in different directions. 
Brush stroking was applied across the receptive field of the hindlimb in different 
directions at a speed of 18 to 22 cm/s. (land J) Representative trace (top) 
and corresponding PSTH (1-s bin) (bottom) (I) and firing rate (J) of spinal Prok 
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creases heart rate (28) and pain sensation (29), 
reflecting a soothing state with a reduced level 
of stress (e.g., increasing the threshold for 
thermal pain). To ascertain whether mice 
might develop similar physiological changes, 
we measured the heart rate, thermal pain, and 
stress hormones after gentle stroking. Mice 
showed significantly reduced heart rates, in- 
creased thermal pain thresholds, and reduced 
plasma corticosterone levels (fig. S4, E to G). 
Consistent with behavioral studies, ABL mice 
displayed no significant reduction of the heart 
rate or analgesic effect in response to gentle 
stroking (Fig. 3, E and F). The lack of behav- 
ioral and physiological changes in ABL mice 
further demonstrates the crucial role of PROKR2 
neurons in conveying pleasant touch. 
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neurons in response to 10 repeated brush stroking stimuli with intervals of 2 s. 
(Inset) Superimposed waveforms on expanded time scale representing spikes of 
Prokr2°"®* neurons evoked by stroking in (D), (G), and (I). n = 8 to 10 neurons from 
three to four mice. Statistics by one-way repeated measures ANOVA followed by 
Bonferroni post hoc [(E) and (J)] or by paired t test (H). *P < 0.05; **P < 0.01; 
**P < 0.001; n.s., not significant. Error bars indicate SEMs. 


29 APRIL 2022 » VOL 376 ISSUE 6592 487 


RESEARCH | RESEARCH ARTICLES 


Neurophysiological features of PROKR2 neurons 
in response to varying stimuli 

To interrogate neural correlates of pleasant 
touch, we performed in vivo extracellular re- 
cording of the response of spinal PROKR2 neu- 


across the receptive 


ing ChR2-eYFP (Fig. 


rons to gentle stroking using a soft brush moving 
the hindlimb of Prokr2“°;Ai32 mice express- 


neurons were classified as spinal Prokrg® 


neurons if they displayed reliable action po- 
tentials with a short response latency (10 ms) 
upon the delivery of a brief pulse of blue light 
(Fig. 4B) (30). Notably, the response of Prokr™® 
neurons was most vigorous when the hairy 
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Fig. 5. Conditional deletion of Prok2 in sensory neurons abolishes pleasant 
touch sensation. (A) Double staining of Prok2 with various markers in DRG neurons. 
Arrows indicate double-stained cells, and arrowheads indicate Prok2 only. Scale 
bar, 20 um. (B) Overlapping percentages of (A). Gray indicates double-stained ratio, 
and blue indicates Prok2 only. (C) (Left) Schematic of targeting strategy for 
generating Prok2 floxed (Prok2”") mice, which were mated with Na,J.8°° mice to 
generate Prok2”"Na,1.8°° mice or Prok2 CKO mice. (Right) Gel electrophoresis of 
genotyping polymerase chain reaction (PCR) from Prok2”", wt/wt, and Prok2”™* 
samples. (D) Expression of Prok2 in DRGs of the control (Prok2’”", WT) and Prok2 
CKO mice (Prok2'Na,1.8°°°, CKO). Scale bar, 20 um. (E) Quantification of 
(D). (F) A schematic of the experimental procedure. Parallel symbols indicate 
that PT-CPP, heart rate, and hot plate tests were performed independently 
after single-housing and home-cage stroking. (@ and H) Representative 
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trajectory plots (G) and time spent (H) in stroking-paired chamber for WT 
and CKO mice in the preconditioning baseline (Pre-c BL) and preference test 
(Pref T). Pre-c BL versus Pref T: P < 0.01 for WT and P = 0.9552 for CKO mice. 
(land J) Heart rate test (I) and hot plate test (J) in WT and CKO mice. Heart rate, 
BL versus poststroking: P < 0.01 for WT and P = 0.8737 for CKO mice; hot plate, 

BL versus poststroking: P < 0.01 for WT and P = 0.1531 for CKO mice. (K to 

0) Comparable latencies in Hargreaves test (K) and cold plate test (L), withdraw 
threshold in von Frey test (M), licking or flinching time induced by capsaicin (2 g, i.pl.) 
(N), and scratching numbers induced by chloroquine injection (200 yg, i.d.) (0) 
between WT and CKO mice. n = 3 (E); n = 8 [(H) to (0)]. Statistics by unpaired t test 
[(E) and (K) to (O)] or by two-way repeated measures ANOVA followed by Bonferroni’s 
multiple comparisons test [(H), (I), and (J)]. **P < 0.01; ***P < 0.001; nss., not 
significant. Error bars indicate SEMs. 
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skin of mice was brushed at a slowly moving 
speed (18 to 22 cm/s); however, their firing 
became sluggish if the stroking speed was 
slower (2 to 3 cm/s) or faster (37 to 45 cm/s) 
(Fig. 4, C and D). The mean firing rates at 
three brushing speeds exhibited an inverted 
U shape (Fig. 4E), reminiscent of that of hu- 
man CT fibers (17, 18). This prompted us to 
examine whether other features of Prokr2™* 
neurons may resemble the hallmarks of hu- 
man CT fibers (J, 16, 17, 19). One signature of 
CT fibers is that they show no preference for 
orientation in the receptive field (79). Prokrgt? 
neurons displayed comparable firing rates, 
irrespective of stroking directions (e.g., from 
rostral to caudal or from left to right; Fig. 4, 
F to H). Another distinctive feature is fa- 
tigue to the repetition of brush stroking, 
referring to gradually attenuated responses 
to repeated stroking stimuli within seconds 
(1, 16, 17, 19). This feature distinguishes un- 
myelinated CT fibers from myelinated low- 
threshold mechanoreceptors (78) and has 
also been observed in cutaneous C mecha- 
noreceptor afferents in rats (37) and cats (32). 
The firing rate of Prokr2™*? neurons also 
exhibited fatigue to repeated brush stroking 
and was reduced by 40.7 to 78.8% within 
10 series of successive brush stroking (2-s 
interval) (Fig. 4, I and J). Consistent with a 
recent study showing the presence of CT af- 
ferents in human glabrous skin (33), PROKR2 
neurons also responded to gentle stroking 
applied to the glabrous skin (fig. S5A), in- 
dicating the presence of PROK2 fibers in the 
glabrous skin. 


Fig. 6. Profound impairments of A 
PROR2-PROKR2 mutant mice in 

stress response and prosocial 50 
behaviors. (A to C) Stress and 

anxiety-like behavioral tests. Shown 

are the percentages of time spent in 

the center zone of the open field 

apparatus (A), time spent in the light- 
illuminated chamber of the light- 

dark box (B), and time spent in the 

open quadrants of the elevated zero D 
maze (C). WT versus ABL: P = 0.2795 

(A), P = 0.3514 (B), and P = 0.4312 

(C); WT versus CKO: P < 0.05 (A), 

P < 0.001 (B), and P < 0.01 (C). 
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(D) The three-chamber social novelty #40 
test. The preference index for the 8 20 
percentage of time spent exploring 2 0 
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mouse versus a familiar mouse (left). 
(Right) Representative heatmaps 
of locomotor activity in the chambers. 
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At last, we examined several nongentle 
stroking-related stimuli that could elicit the 
response of CT fibers (16, 34). Consistently, 
Prokr2™* neurons responded to punctate 
stimulation (von Frey filament at 0.07 g or 
0.7 mN), pinprick stimulation, and cooling 
temperature (30° to 15°C) (fig. S5, B to D). 


Coding of pleasant touch by PROK2 in 
sensory neurons 


We next examined Prok2 expression in several 
types of dorsal root ganglion (DRG) neurons 
using RNAscope combined with IHC. Prok2 
partially overlaps to varying degrees with TRPV1 
and calcitonin gene-related peptide (CGRP), 
which are expressed in nociceptive neurons; 
IB4; NF200, a neurofilament expressed predom- 
inantly in large-diameter neurons; MrgprB4; 
and tyrosine hydroxylase (TH) expressed in some 
C-low-threshold mechanoreceptors (C-LTMRs) 
(35) (Fig. 5, A and B, and fig. S6F). Overall, 
~59.6% of DRG neurons express Prok2. To 
ascertain whether Prok2-expressing primary 
afferents form monosynaptic contacts with 
PROKR2 neurons, we performed rabies virus- 
mediated retrograde tracing in the spinal cord 
of Prokr2“* mice (25) (fig. S6, A and B). Ex- 
amination of Prok2 expression with retrograde 
transported glycoprotein (G)-deleted rabies 
virus (RVdG) that labels the input neurons 
with dsRed in DRGs revealed that ~70% of 
the input neurons were colabeled with Prok2 
(fig. S6, C to E). Only a small portion of CGRP 
fibers and minimal TRPVI1 or TH fibers formed 
direct contacts with PROKR2 neurons, whereas 
no MrgprB4 and IB4 afferents were found to 


form monosynaptic contacts with PROKR2 
neurons (fig. S6, C and D). However, it should 
be noted that a lack of inputs from TH or IB4 
afferents could be because of their resistance 
to rabies virus infection (36). Further, we show 
that the sizes of Prok2 input neurons normally 
are in the 200-to-400-um” range, which indi- 
cates that they are small DRG neurons (fig. S6E). 

To determine the role of PROK2 in pleas- 
ant touch, we generated mice harboring a 
floxed allele of Prok2 using the gene targeting 
strategy (Fig. 5C). Floxed Prok2 mice were 
bred with Na,I.8°° mice, which express the 
Cre recombinase in small nociceptive sensory 
neurons, to delete Prok2 in Na, ls neurons 
of DRGs (Fig. 5C). The expression of Prok2 
was reduced by 52.8% without affecting IB4, 
CGRP, or MrgprB4 in DRGs and IB4 or CGRP 
central fibers in the lumbar cord (Fig. 5, C to 
E, and fig. $7, A to D). Mice with conditional 
knockout of Prok2 in DRGs, referred to as 
Prok2 CKO mice, failed to show PT-CPP (Fig. 5, 
F to H). By contrast, they showed normal pain 
and itch behaviors (Fig. 5, K to O). Further- 
more, Prok2 CKO mice did not show signifi- 
cant changes in the heart rate and thermal 
pain thresholds after gentle stroking (Fig. 5, 
ITand J). CT and C-LTMRs are insensitive to 
capsaicin, which activates TRPV1 fibers (37, 38). 
This prompted us to explore the role of TRPV1 
fibers in pleasant touch. Ablation of the cen- 
tral terminals of TRPV1 fibers with intra- 
thecal resiniferatoxin (RTX), a potent TRPV1 
agonist, abolished neurogenic pain elicited by 
capsaicin and markedly attenuated mechani- 
cal itch that is dependent on AB-LTMR and 
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WT versus ABL: P < 0.05; WT versus CKO: P < 0.01. (E to G) The home-cage social grooming test. Cartoons show mouse allogrooming in the home cage (E). (F and G) 
Allogrooming time for each pair. Groomer-groomee (F), WT-WT pair versus WT-ABL pair: *P < 0.05; WT-WT pair versus ABL-WT pair: ***P < 0.001. Groomer-groomee (G), 
WT-WT pair versus WT-CKO pair: ***P < 0.001; WT-WT pair versus CKO-WT pair: ***P < 0.001. n = 8 [(A) to (D) and (G)]; n = 10 to 11 (F). Statistics by unpaired t test [(A) to 
(D)] or one-way ANOVA followed by Bonferroni post hoc [(F) and (G)]. *P < 0.05; **P < 0.01; ***P < 0.001; n-s., not significant. Error bars indicate SEMs. 
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C fiber inputs and GRP and GRPR neurons 
(22, 25) (fig. S8, A to D). Notably, RTX treat- 
ment had no effect on PT-CPP (fig. S8, E to G). 


Profound deficits in stress response and 
prosocial behaviors of PROK2 mutant mice 


Pleasant social touch affords tremendous emo- 
tional and psychological benefits by indirectly 
activating the endogenous reward-pleasure 
circuits to release a plethora of neuropeptides 
and neurotransmitters that encode antistress, 
positive hedonic, and prosocial value (J, 7, 39-42). 
We sought to examine whether a loss of pleas- 
ant touch developmentally or in adult mice 
may result in abnormal stress responses. Prok2 
CKO mice spent significantly less time than 
their WT littermates in the center zone of the 
open-field apparatus, in the light-illuminated 
chamber of the light-dark box, and in the open 
quadrants of the elevated zero maze apparatus 
(Fig. 6, A to C). By contrast, the time that ABL 
mice spent in these areas did not differ from 
that of their WT littermates (Fig. 6, A to C). We 
further assessed social novelty recognition of 
mutant mice using the three-chamber social 
interaction test. Unlike WT mice, neither Prok2 
CKO nor ABL mice displayed a preference for 
a newly introduced mouse, demonstrating se- 
vere deficits in social novelty recognition (Fig. 
6D and fig. S9). 

Finally, we evaluated whether Prok2 CKO or 
ABL mice might have deficits in social touch 
behavior. To examine this, we first monitored 
spontaneous social interactions between C57BL- 
6J paired adult mice in the home cage, which 
subserves a more naturalistic environment, 
and observed frequent social or allogrooming 
behaviors unrelated to sexual or aggressive 
and/or conflict activity (Fig. 6E and movie S5). 
In sharp contrast to the paired WI-WT con- 
specifics, social grooming of WT mice toward 
ABL mice was significantly diminished (Fig. 6F). 
Unexpectedly, ABL mice rarely groomed WT 
conspecifics capable of sensing pleasant touch 
(Fig. 6F). These deficits were even more pro- 
nounced in the WT-Prok2 CKO pairs (Fig. 6G). 


Discussion 


Using an interdisciplinary approach coupled 
with autonomic, neuroendocrine, electrophysio- 
logical, and behavioral criteria, our study 
demonstrates the crucial function of the 
PROK2-PROKR2 signaling pathway in pleasant 
touch. The profound loss of pleasant touch 
sensation in Prok2 CKO mice underscores the 
pivotal role of PROK2 in the coding and trans- 
mitting of pleasant touch information. Our 
work reveals two parallel peptidergic path- 
ways, distinguished by their capsaicin sensi- 
tivity, from the skin to the spinal cord: One 
conveys positive valence to PROKR2 inter- 
neurons, which constitutes an obligate circuit 
for sending pleasant touch to the brain via 
Gpr8s3 neurons, and the other conveys nega- 
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tive valence to the brain through spinal pain- 
and itch-specific local microcircuits and NKIR 
projection neurons (20, 24, 43). We argue that 
PROK2 in Af and Aé fibers is unlikely to be 
involved in pleasant touch because PROKR2 
neurons receive no direct inputs from these 
fibers. Moreover, unlike unmyelinated C fibers 
that encode the specificity of sensory modalities 
through slow-acting neuropeptides, myelinated 
Af and A6 fibers typically use fast-acting 
glutamate as a neurotransmitter to relay in- 
formation (22). We establish the function of 
PROK2 in encoding affective properties of gentle 
stroking; however, understanding how Af and 
Ad LTMR fibers contribute to the transmission 
of discriminative properties of gentle touch 
and stroking will require further studies. 
Although PROK2 fibers that synapse with 
PROKR2 neurons could be the equivalent of 
human CT fibers, it is not feasible to examine 
their conduction velocities or cutaneous in- 
nervation patterns in isolation. Nonetheless, 
notable neurophysiological features shared 
by PROKR2 neurons and human CT fibers 
indicate that a subset of PROK2 fibers is 
equivalent to CT fibers. The response of 
PROKR2 neurons to cooling raises the pos- 
sibility that PROKR2 neurons may be a con- 
vergent node for integrating different kinds 
of cutaneous information that encodes pos- 
itive hedonic valence (e.g., pleasantness of 
cooling), akin to GRPR neurons subserving as 
a convergent node for mechanical and chem- 
ical itch (25). These data further support the 
neuropeptide code model that somatosensory 
modalities with slow response kinetics are 
encoded by neuropeptides in sensory neurons 
and conveyed by respective spinal microcircuits 
that can be defined and identifiable through 
specific GPCR expression (22). The development 
of the PT-CPP test that entails inference of 
pleasant touch in an unbiased manner over- 
comes a major obstacle in the interrogation 
of molecular underpinnings and neural circuits 
of pleasant touch. Together with the ethologically 
relevant seminaturalistic social grooming para- 
digm, we provide an avenue to unravel mecha- 
nisms by which the need for affective touch 
drives social attachment and affiliative behaviors. 

This study has important clinical implica- 
tions. The heightened stress and anxiety-like 
behaviors of Prok2 CKO but not ABL mice 
are attributable to a lack of pleasant touch 
during a critical period in development, which 
supports prior studies showing that early tac- 
tile experience is more instrumental in shap- 
ing the resilience of offspring against stressful 
events (44, 45). Accordingly, ABL mice are less 
vulnerable to stress. In light of the challenge in 
the study of the developmental role of pleasant 
touch because of the multisensory nature of 
parental care (46), Prok2 CKO mice might 
serve as an invaluable animal model for as- 
sessment of the long-term effects of depriva- 


tion of maternal or caregiving nurturing touch 
on offsprings’ development and health (47). 
A dearth of pleasant touch sensation could 
dysregulate prosocial neuropeptide expres- 
sion in the brain and thereby impede social 
recognition and interactions. The failure of 
mutant mice to recognize unfamiliar con- 
specifics indicates a crucial role of pleasant 
touch in social recognition and social memory 
known to be important for social bonding (48). 
Additionally, the marked avoidance of the 
social touch phenotype is reminiscent of some 
early traits of ASD (10, 14, 49). Further analysis 
of prosocial behavioral impairment in Prok2 
CKO mice may offer additional insights into 
the etiology of certain neurodevelopmental and 
affective disorders that hinder social interac- 
tions and affiliative behaviors. The inability 
of mutant mice to groom WT conspecifics 
stresses the importance of PROK2 signaling 
in mediating synchronous and bidirectional 
communication of mutually beneficial social 
information through reciprocal tactile con- 
tacts (6). Finally, the observation that the firing 
features of PROKR2 neurons recapitulate the 
hallmarks of human CT fibers reinforces the 
notion that neural mechanisms of pleasant 
touch are conserved between humans and 
rodents. Conceivably, a deficiency of PROK2- 
PROKR2 signaling might result in social and 
emotional impairments that could lead to so- 
cial isolation, anxiety, and mental disorders. 
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ZEOLITES 


In situ imaging of the sorption-induced subcell 
topological flexibility of a rigid zeolite framework 


Hao Xiong"t, Zhiqiang Liu2t, Xiao Chen’, Huigiu Wang?, Weizhong Qian’, Chenxi Zhang“, 


Anmin Zheng’, Fei Wei* 


The crystallographic pore sizes of zeolites are substantially smaller than those inferred from catalytic 
transformation and molecular sieving capabilities, which reflects flexible variation in zeolite opening 
pores. Using in situ electron microscopy, we imaged the straight channels of ZSM-5 zeolite with benzene 
as a probe molecule and observed subcell flexibility of the framework. The opening pores stretched 
along the longest direction of confined benzene molecules with a maximum aspect change of 15%, and 
the Pnma space group symmetry of the MFI framework caused adjacent channels to deform. This 
compensation maintained the stability and rigidity of the overall unit cell within 0.5% deformation. 
The subcell flexibility originates mainly from the topologically soft silicon-oxygen-silicon hinges between 
rigid tetrahedral SiO, units, with inner angles varying from 135° to 153°, as confirmed by ab initio 


molecular dynamics simulations. 


eolite pores, which have diameters rang- 
ing from ~0.3 to ~1.3 nm, dictate their 
molecular sieving properties and control 
access to internal sites that are catalytically 
active or preferred for binding by sorbates 
(1-4). By adjusting the size and shape of pore 
openings, the size and shape of adsorbing 
molecules that fit inside the pores can be 
selected, thereby excluding larger molecules. 
This effect is exploited in selective chemical 
conversion (4-9). However, the effective pore 
sizes calculated from the crystallographic struc- 
tures are substantially smaller than those infer- 
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red from catalytic transformation and molecular 
sieving capabilities (8-10). It has long been 
speculated that this discrepancy reflects in part 
a flexible deformation of zeolite pores. 

Nonetheless, the flexible deformation of 
zeolite pores is rarely reported. Unlike flexible 
metal-organic frameworks (MOFs) with long 
and soft organic linkers (11-13), zeolite mate- 
rials (with an elastic modulus in the range of 
50 to 100 GPa) macroscopically behave as 
rigid and fragile materials in most applications 
(14-16). Findings related to zeolite flexibility 
have included minor variations in cell param- 
eters (<10 pm) and a symmetry transformation 
from monoclinic to orthorhombic as a function 
of temperature and sorbates (17-23). 

Direct measurement of pore sizes is, in 
principle, possible by diffraction, such as x-ray 
diffraction and electron diffraction. Nonetheless, 
the derived Debye-Waller factors for zeolites 
usually compensate for the complexity of zeolite 
crystal structures and various other model 
deficiencies, including inhomogeneity effects. 


Furthermore, averaged cell-scale results can- 
not indicate the local structure of deformed 
opening pores caused by successive twisted 
tetrahedrons. Crystal dynamics simulation 
techniques have been applied to trace the 
motion and vibration of specific atoms within 
zeolite frameworks, and some progress toward 
describing framework flexibility has been made, 
but supporting experimental evidence has been 
lacking (2, 9, 24-26). Thus, new experimental 
technology with ultrahigh spatial resolution 
is needed to reveal the subcell local structural 
evolution of the zeolite framework when guest 
molecules break through the pore size limit, 
especially in practical applications. 

Integrated differential phase contrast scan- 
ning transmission electron microscopy (iDPC- 
STEM) can reveal the local structures of zeolites 
and image the confined molecules inside them 
in real space at an atomic scale (27-30) be- 
cause of its excellent capability for imaging 
light and heavy elements together (3/, 32). 
Here, we combined iDPC-STEM imaging with 
an in situ atmosphere system to monitor the 
molecular phase transition and corresponding 
geometric variations of opening pores during 
the benzene adsorption-desorption process in 
real time. We studied the straight channel 
(5.3 A x 5.6 A) of ZSM-5 (MFI-type) zeolite as 
the imaging window and used benzene, which 
has a kinetic diameter of 5.85 A, as a probe 
molecule. We observed the phase transition of 
confined benzene molecules and resolved the 
varying atomic structure of the MFI frame- 
work. Thus, we successfully observed the subcell 
flexibility of the zeolite framework, investigated 
the local deformation of zeolite channels, and 
monitored the dynamic evolution process 
when guest molecules enter or exit the zeolite 
framework. The results will enable us to better 
understand the topologically flexible structural 
characteristics of zeolites and the intrinsic 
mechanism of molecular diffusion in micro- 
porous materials. 
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Fig. 1. Imaging the huge local deformation of zeolite channels in the in situ 
STEM system during benzene adsorption. (A) Schematic of the in situ STEM 
experimental setup. The benzene adsorption-desorption process was carried 
out in an enclosed environment chamber made of Si3N4 solid membranes in an 
atmosphere system. (B to D) iDPC-STEM images and profile analysis of an empty 
MFI zeolite specimen from the [010] projection. The magnified image shows a 
nearly round pore. The profile analysis gives the aspect ratio of Dinax to Din, 
which was used to estimate the local deformation of straight channels. (E to G) 
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iDPC-STEM images and profile analysis of a saturated benzene@MF| specimen 
from the [010] projection at 473 K and 450 torr (90% Nz and 10% benzene). The 
10-ring opening pore shows a distinctly elliptical shape stretched along the 
parallel direction of the benzene plane. (H to J) Further comparison of imaging 
differences between the empty (H) and saturated states (|) of benzene@MFI, 
showing clearer imaging of O atoms in the MFI framework, which are highlighted 
in (J). Forty opening pores were measured to obtain each statistical result. Scale 
bars, 2 nm [(B) and (E)], 500 pm [(C), (F), (H), and (1)]. 


In situ observation of subcell deformation in 
zeolite channels 

ZSM-5 zeolites are composed of corner-sharing 
tetrahedral TO, (T: Si, Al) and have a cross- 
linked channel system formed of straight 
channels (5.3 A x 5.6 A) and sinusoidal chan- 
nels (5.1 A x 5.5 A) (fig. SI). The straight 
channels of MFI zeolite can be directly imaged 
from the [010] projection at the atomic scale 
and are the best imaging windows for observ- 
ing opening pores and confined molecules 
(fig. S2). Inspired by the synthesis strategy 
for nanosized lamellar architectures (8, 33-37), 
we synthesized coffin-shaped MFI single crys- 
tals that had a smooth surface and uniform 
thickness (~40 nm) along the 0 axis (figs. S3 
and S4). By eliminating the influence of height 
variance on STEM imaging, we could observe 
in situ both adsorbed benzene molecules and 
the zeolite framework in a large field of view 
(25.74 nm x 25.74 nm). 

Figure 1A exhibits the schematic of the in 
situ STEM experimental setup. The experiments 
were performed in a spherical aberration (Cs)- 
corrected transmission electron microscope 
equipped with an enclosed environment cham- 
ber where environmental temperature, inlet 
gas composition, and volume flow could be 
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regulated precisely. Full experimental details 
are given in the supplementary materials. Be- 
fore benzene adsorption, the fresh MFI zeolite 
specimen was degassed under vacuum at 923 
K for 5 hours. From the [010] projection, the 
projected positions of Si atoms and empty 
straight channels were imaged (Fig. 1B); the 
O atoms were covered by nearby Si atoms 
because of thermal diffuse scattering and 
lattice vibrations. 

To better describe the varying geometries 
of straight channels, we defined the distances 
of Si, to Sig and Si, to Sig as the long axis 
(Dmax, red line) and short axis (D,yin, blue line), 
respectively, as the periodic parameters for 
straight channels (Fig. 1, C and F, and fig. S5), 
which is slightly different from the definition 
of the pore size (the distance of O atoms) of 
straight channels (38). The magnified image 
of empty straight channels showed a nearly 
round pore shape (Dax, 9.14 + 0.11 A; Dmin, 
8.91 + 0.16 A) with an aspect ratio of 1.02 (Fig. 
1D and fig. S6), which is highly consistent with 
the standard structural model (fig. S5). This 
symmetrical and uniform shape of opening 
pores originates from high crystallographic sym- 
metry and isostatic perfectly regular tetrahedra, 
indicating no internal stress accumulation. 


After benzene adsorption at 473 K and 450 
torr (10% benzene and 90% N,), the close size 
match of one C.-ring plane and the channels 
of ZSM-5 zeolite caused the benzene molecules 
to be stably confined in straight and sinusoidal 
channels through van der Waals interactions 
(39, 40). Following the principle of energy opti- 
mization, the degrees of freedom of adsorbed 
molecules would be reduced to several states 
(28, 29). Thus, the confined benzene molecules 
were “frozen” in zeolite channels with a fixed 
orientation and critical dimensions (7.367 A x 
6.702 A x 3.40 A) (41, 42) that were different 
from the kinetic diameter in the gas phase 
when assumed to be a sphere (7). 

The total overlapping projection of these 
benzene molecules formed an elliptical spin- 
dle with the smallest cross-sectional area of 
benzene in each straight channel (Fig. 1E). The 
unified orientation and superimposed contrast 
along the [010] projection enabled us to simul- 
taneously realize the imaging of small molecules 
and the zeolite framework. Moreover, we ob- 
served substantial local deformation of the 
straight channels, from an approximate round 
shape to a clear elliptical shape (Fig. 1, F and 
G). The straight channels were stretched along 
Dynax (9.73 + 0.16 A), which was parallel to 
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Fig. 2. Dynamic evolution of zeolite channels and corresponding 
host-guest interactions in benzene desorption. (A to F) iDPC-STEM 


@ Intensity 
* Aspect Ratio 


Time (min) 


Oriented Aromatic Column 


BNE 


Angle (° 


changes in benzene contrast and channel geometry at different stages. 
(H) Evolution of the normalized benzene/zeolite framework contrast ratio 
and corresponding aspect ratios of Sijo opening pores during the in situ 


snapshots of a benzene@MFI specimen along the [010] projection at different 
stages during the cyclic benzene desorption process. Benzene was taken 

up at 473 K and 450 torr (10% benzene and 90% Nz) and released under 
vacuum; the saturated state is denoted as 0 min. After ~80 min, the 
temperature was increased to 923 K to release the remaining benzene 
molecules. (G) Magnified iDPC-STEM images of straight channels extracted 
from (A) to (F) (marked by the corresponding colored boxes), indicating the 


benzene uptake and release process. The pink stars indicate points where 
the aspect ratios of Sijo rings are correlated with the imaging contrast 

of confined benzene molecules, which reached 1.15 in the saturated state. 
Fifty opening pores were measured to obtain the error bars at each point. 
(I) Statistical distribution of benzene orientations at different stages of 
benzene desorption. Scale bars, 2 nm [(A) to (F)], 500 pm (G). 


the C, plane of benzene molecules, and com- 
pressed along Dmin (8.31 + 0.12 A), with an 
aspect ratio of 1.176 (Fig. 1F). The difference 
of the Dynax ANd Dyin reached 15%. The maxi- 
mum pore diameter marked by Si atoms was 
increased by 0.6 A from the [010] projection 
to match the critical dimension (6.702 A) of 
benzene molecules, determined from the 
smallest cross section. This conclusion was 
further confirmed in the iDPC imaging results 
with different scan rotation angles (figs. S7 
and S8), which indicates that the effect of scan 
distortion could be neglected. 

This geometrical match further reduced the 
freedom of molecular translation and rotation 
and constrained confined benzene molecules 
to adopt nearly identical configurations, where 
the C, planes were parallel to Dya,. This local 
deformation of zeolite opening pores suggested 
that the zeolite framework would change as 
needed to adapt to the entrance or exit of guest 
molecules, which is the core of the diffusion or 
formation of larger guest molecules breaking 
through the crystallographic pore size limit. 
Despite such severe local deformation, the 
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overall crystal structure of ZSM-5 zeolite re- 
mained intact, and the unit cell parameters 
were almost unchanged, mainly due to the sym- 
metry of the MFI framework (see below). 

We achieved better imaging of O atoms in 
the zeolite framework during benzene adsorp- 
tion. Given the 40-nm thickness of the speci- 
men, thermal diffusion and multiple electron 
scattering could not be ignored and inevitably 
hampered the imaging resolution (43, 44). 
Thus, we could only observe Si atoms on the 
zeolite framework before benzene adsorption, 
and the O atoms were blurred by nearby Si 
atoms (Fig. 1H). However, after benzene ad- 
sorption, we observed framework O atoms 
(Fig. 11). The observed O atoms are highlighted 
with orange circles in the corresponding crystal 
structure schematic (Fig. 1J); Fig. 1, H and I, was 
cropped from fig. S9. We speculated that this 
difference in O imaging occurred for two rea- 
sons. When interacting with confined benzene 
molecules, the flexible framework was tight- 
ened, and the vibration of O atoms in the 
zeolite framework was restrained, especially 
in the Si) rings. Moreover, the straight chan- 


nels were stretched along D,,a,x and com- 
pressed along Dyin, Which brought an increased 
distance between adjacent Si-O-Si from the 
[010] projection. 


Dynamic imaging of the benzene 
desorption process 


To further elucidate the host-guest interaction 
between guest molecules and the zeolite frame- 
work, we performed an in situ desorption ex- 
periment to investigate the dynamic evolution 
process of zeolite channels. Real-time iDPC- 
STEM imaging results at different stages are 
shown in Fig. 2. Before benzene adsorption, 
the specimen was also degassed as described 
above. The holder temperature was decreased 
to 473 K and stabilized for 30 min. We then 
introduced the gas mixture (10% benzene 
and 90% Np, 450 torr). After 1 min, which was 
shorter than the time required to obtain a 
single iDPC-STEM image, the zeolite frame- 
work was almost completely filled with ben- 
zene (fig. S10). Thus, it was difficult to observe 
the dynamic details of the adsorption process. 
In contrast, the desorption of benzene molecules 
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Fig. 3. Investigating the chemical nature of MFI framework flexibility 
via ab initio molecular dynamics simulation. (A) Distributions of pore 
size for benzene (CgHe) molecule adsorption with different loadings as 
calculated from AIMD simulations at 473 K. (B to D) Pore size evolution 
with time for 0 (B), 1 (C), and 4 (D) CgHg molecules adsorbed inside MFI 
zeolite. (E) Schematic diagram of the tetrahedral SiO, linkage with three 


was a Slow process that could be observed with 
iDPC-STEM imaging. To quantitatively describe 
the benzene desorption process, we normal- 
ized the contrast of benzene spots in differ- 
ent images according to the contrast of the 
zeolite framework in the related profile anal- 
ysis, given that we had been observing the 
same area (fig. S11). 

We recorded the time at which the zeolite 
framework reached adsorption saturation as 
time zero; at the same time, we kept the tem- 
perature constant and switched the atmo- 
sphere to vacuum. Figure 2, A to F, shows 
real-time images of straight channels through- 
out the desorption process. As benzene was 
released, the contrast of the benzene spots in 
straight channels continuously decreased (ini- 
tially fast and then more slowly). Most benzene 
molecules were desorbed within the first 40 min 
(Fig. 2H). Finally, most of benzene molecules 
were desorbed after heating to 923 K. Further- 
more, we compared the in situ STEM results 
with the thermal gravimetric analysis (TGA) 
results (fig. S12). The initial decline rate of the 
TGA desorption process was faster because of 
the contribution of benzene molecules ad- 
sorbed on the outer surface. Finally, most of 
the benzene was desorbed in a similar time. 

As benzene was released, the channel geo- 
metry changed with the number of adsorbed 
benzene molecules. With the decrease in the 
contrast of benzene spots, the degree of chan- 
nel deformation (described as the aspect ratio 
of Dmax/Dmin) gradually decreased, and the 
shape returned from an ellipse to an approx- 
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imately round shape (Fig. 2G). The deformation 
degree of the straight channels was positively 
correlated with the number of adsorbed ben- 
zene molecules in the straight channels; this 
was further confirmed in saturated ZSM-5 
zeolites at different temperatures (fig. S13). 
This relation indicated that the host-guest 
interaction was related to the number of 
adsorbed molecules and would further affect 
the shape of pores. 

Conversely, the geometry of the opening 
pores also affected the arrangement and orien- 
tations of confined benzene molecules (Fig. 21). 
With benzene desorption, whereas the pore 
shapes tended to resemble ellipses with a 
smaller curvature, the dominant configuration 
of benzene molecules changed in response 
(from 52° to 46°) and the angle distribution 
became wider, indicating a weakened host- 
guest interaction or confinement to facilitate 
molecular vibration and rotation. Thus, as the 
adsorbed benzene molecules deformed the 
opening pores, the deformed geometry of 
the zeolite pores dictated the dominant con- 
figurations of guest molecules in return. In 
this regard, this observed alterable geometry 
of zeolite channels indicated that host-guest 
interactions in flexible frameworks are not 
immutable. To better understand the different 
predominant pathways at different loadings, 
apart from the interaction between guest 
molecules, the variation in geometry of the 
opening pores needs to be considered when 
we describe the entropy and enthalpy of ad- 
sorption (45, 46). 


O-Si-O bond 
structural defining parameters. (F to H) Statistical results for the Si-O-Si 
angle (F), O-Si-O angle (G), and Si-O bond length (H) indicating a loose 
network composed of rigid tetrahedral SiO, and soft hinges at the 
oxygen atoms. In the statistics of each parameter, the corresponding 
number of variables is listed in the inserted schematic. See figs. S14 to S17 
for details. Time step: 0.5 fs. 
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Ab initio molecular dynamics simulations at 
different loadings 

To gain more insight into the subcell defor- 
mation of the ZSM-5 zeolites, we performed ab 
initio molecular dynamics (AIMD) simulations 
and calculated the distributions of the cor- 
responding lengths of both axes (Fig. 3, A to 
D). AIMD simulation is a powerful tool for 
in situ tracking of the dynamic changes of 
chemical bonds (such as bond lengths and 
angles) for zeolite catalysts at the picosecond 
scale under working conditions (9, 25). Both 
the maximal (D,y,x) and minimal (Din) pore 
sizes of straight channels (Fig. 3A) with ad- 
sorbed benzene molecules of various loadings 
were detected at the experimental temper- 
ature (473 K). The STEM imaging mode ob- 
tained images by scanning point by point, 
so it often took several minutes to form an 
image and much longer than the time scale 
of the atomic thermal motion in the frame- 
work. However, AIMD simulations can pro- 
vide more detailed information on zeolite 
framework thermal motion at a picosecond 
time scale. 

In the absence of adsorbed benzene mole- 
cules, the distributions of D,yax and Dyin Were 
similar, with peaks located at ~9.1 and ~9.0 A, 
respectively. Tracing the variances in pore 
size over time revealed that the pore chan- 
nels were highly flexible and tended to be 
circular statistically, but that there were 
alternating shifts of Dna, and Dyin (Fig. 3B) 
suggestive of a thermally activated breathing 
motion of the zeolite pores. 
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Fig. 4. Pnma space group symmetry of MFI zeolite brings subcell topological flexibility and overall 
rigidity. (A to C) IDPC images of a straight channel before and after benzene adsorption show severe 
local deformation, with a high aspect ratio of 1.17. (D and E) iDPC images of the MFI framework before 
and after benzene adsorption from the [010] projection with nearly identical cell parameters, from the 
statistical data of 10 x 10 cells. (F and G) Corresponding structures of the MFI framework before (F) 
and after (G) benzene adsorption in [010] projection. The yellow ellipses present the shapes of straight 
channels. Scale bar, 500 pm [(A) and (B)], 2 nm [(D) and (E)). 


However, as more benzene was adsorbed, 
the flexibility of the pore channels decreased, 
and the shapes tended to resemble an ellipse 
with a greater curvature (Fig. 3, C and D). For 
example, when one benzene molecule was 
adsorbed per unit cell, the mean D,,,x and 
Dmin Were 9.6 and 8.6 A, respectively. The 
deviation between Dax and Dyin increased 
(to ~9.9 and 8.3 A, respectively) when four 
benzene molecules were confined inside each 
MFI zeolite unit cell. Moreover, the standard 
deviation of the pore size distribution tended to 
decrease as more benzene adsorbed, indicating 
decreasing flexibility in the Sij) opening pores 
(Fig. 3A and table S1). Overall, as the amount 
of adsorbed benzene increased, the deforma- 
tion of MFI straight channels increased, which 
was consistent with our experimental images. 

Structurally, zeolites are composed of corner- 
sharing nearly perfect tetrahedral TO,, con- 
nected by hinges at the oxygen atoms (fig. S1). 
Hence, the different structures of zeolites are 
mainly determined by three structural param- 
eters: o (Si-O-Si bond angle), 8 (O-Si-O bond 
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angle), and d (Si-O bond length) (Fig. 3E). To 
further investigate the origin of MFI zeolite 
flexibility, we performed detailed statistical 
analyses on the bond lengths and angles in 
the 10-membered rings (1OMRs). With benzene 
adsorption, the main contribution to channel 
deformation came from the increase in a (Fig. 3F 
and fig. S15), whereas 8 and d within tetrahedral 
SiO, were almost unchanged (Fig. 3, G and H, 
and figs. S16 and S17). The most substantial 
change came from the second and seventh o, 
which were exactly vertical to the plane of the 
benzene molecule (Fig. 3F). The local stress 
was mainly concentrated in these two Si-O-Si 
bonds and nearby areas, causing the Si-O-Si 
angle to be flattened and distant in the [010] 
projection (fig. S18). 

Previous studies have found that thermo- 
dynamics and kinetics are sensitive to the dif- 
ference in the local chemical environment, 
especially the T-O-T angle at the acid site, which 
can be stretched to influence the intrinsic 
acidity, stabilize the protonated transition 
state, and thereby control the reaction path- 


way in ZSM-5 (47, 48), which would have a 
profound impact on understanding the molec- 
ular reaction mechanism. In contrast, the dis- 
tributions of 8 and d at different loadings were 
nearly identical and near the theoretical value 
of a perfect SiO, tetrahedron (109° and 1.66 A) 
(Fig. 3, G and H). We further examined the 
distributions of six O-Si-O angles within a single 
tetrahedral SiO, in a 1OMR straight channel to 
confirm the rigidity of the SiO, tetrahedron (fig. 
$19). Thus, we concluded that the local flexibility 
of the zeolite framework originated from the 
topologically soft Si-O-Si hinges between adja- 
cent tetrahedral SiO,, whereas the tetrahedral 
SiO, remained rigid and nearly perfect. 


MFI zeolite framework with subcell topological 
flexibility and overall rigidity 


The above structural analysis and theoretical 
calculation results revealed the local flexibility 
in the MFI framework, which allows extensive 
local deformation of the straight channels to 
accommodate larger guest molecules. When 
benzene and other organic sorbates, with a 
comparable molecular size to the opening pores, 
adsorb from the gas phase into the pore- 
confined phase, the molecules fall into a deep 
potential well. The movement and vibrations of 
confined molecules are restricted, and the images 
of confined molecules resemble their molecu- 
lar dimensions, behaving as one-dimensional 
oriented single crystals, such that the mole- 
cules cannot simply be treated as spheres. Thus, 
the opening pores will be stretched according 
to the molecular dimensions of the confined 
molecules, and the maximum change in pore 
diameter can reach 0.63 A in the [010] projec- 
tion (Fig. 4, A to C, and figs. S20 and $21). 

Despite severe subcell deformation, mono- 
crystalline MFI zeolite could still maintain 
an intact crystal structure, and the unit cell 
parameters from the [010] projection were 
basically unchanged (Fig. 4, D and E). Here, 
we used profile analysis to study the unit cell 
parameters (a, c) and their included angle 
from the [010] projection. We only saw slight 
variance in the overall cell parameters (from 
2.033 + 0.002 nm ~x 1.364 + 0.001 nm x 90.5° to 
2.043 + 0.002 nm x 1.365 + 0.001 nm ~x 90.19). 
The changes in cell size and included angle 
were < 0.5%, indicating little structural change 
in the bulk material, which was consistent 
with previous work (8, 20, 23). 

This considerable difference between local 
and global structural changes may originate 
in the symmetry of the zeolite framework. 
Although part of the TO, tetrahedrons was 
twisted by host-guest interactions to form local 
deformation in opening pores, the Pnma 
space group symmetry of the MFI framework 
makes the subcell deformation of adjacent 
straight channels symmetrical, and the changes 
in Dax ANd Dyin appear to alternately compen- 
sate for each other (Fig. 4, F and G). Thus, the 
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particular symmetry of the MFI framework 
gives it local flexibility and overall rigidity, 
which allows subcell deformation to accom- 
modate larger molecules but still maintain 
the stability of the entire zeolite framework. 
We termed this structural characteristic of the 
MFI zeolite as subcell topological flexibility. 


Summary 


Combining iDPC-STEM imaging with an at- 
mosphere system, we achieved in situ obser- 
vation of the dynamic subcell topological 
deformation of MFI channels with atomic 
resolution while the whole MFI zeolite single- 
crystal structure was maintained nearly un- 
changed. We revealed the intrinsic mechanism 
of a molecule breaking through the pore size 
limit. Upon benzene adsorption, the straight 
channels underwent severe local deformation 
along the unified orientations of guest mole- 
cules, which allowed the diffusion of larger 
molecules. 

This subcell topological flexibility mainly 
comes from the soft Si-O-Si hinges between 
rigid tetrahedral SiO,, verified in AIMD sim- 
ulations and our experimental results. In the 
in situ benzene desorption process, we cap- 
tured the dynamic evolution of zeolite channels 
with decreasing numbers of benzene mole- 
cules, indicating that the degree of subcell 
local deformation is positively correlated with 
the shape and amount of confined benzene 
molecules. Furthermore, the symmetry of the 
MFI framework enables mutual compensation 
by large deformation of adjacent channels, 
thereby keeping the overall crystal structure 
stable and resulting in macroscopically rigid 
behavior. As a consequence, the MFI frame- 
work is highly industrially applicable in the 
family of zeolites. These results confirmed the 
microscopic subcell topological flexibility of a 
macroscopically rigid zeolite framework and 
revealed its chemical nature, advancing our 
understanding of the intrinsic mechanism of 
molecular diffusion breaking through the pore 
size limit and the impact of subcell topological 
flexibility on adsorption and catalytic trans- 
formation in microporous materials. 
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Dynamics of CTCF- and cohesin-mediated chromatin 
looping revealed by live-cell imaging 
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Animal genomes are folded into loops and topologically associating domains (TADs) by CTCF and 
loop-extruding cohesins, but the live dynamics of loop formation and stability remain unknown. Here, 
we directly visualized chromatin looping at the Fbn2 TAD in mouse embryonic stem cells using 
super-resolution live-cell imaging and quantified looping dynamics by Bayesian inference. Unexpectedly, 
the Fbn2 loop was both rare and dynamic, with a looped fraction of approximately 3 to 6.5% and a 
median loop lifetime of approximately 10 to 30 minutes. Our results establish that the Fbn2 TAD is 
highly dynamic, and about 92% of the time, cohesin-extruded loops exist within the TAD without 
bridging both CTCF boundaries. This suggests that single CTCF boundaries, rather than the fully 
CTCF-CTCF looped state, may be the primary regulators of functional interactions. 


ammalian genomes are folded into 
loops and domains known as topolog- 
ically associating domains (TADs) by 
the proteins CTCF and cohesin (J). 
Mechanistically, cohesin is thought 
to load on DNA and bidirectionally extrude 


loops until it is blocked by CTCF such that 
CTCF establishes TAD boundaries (2-6). Func- 
tionally, CTCF- and cohesin-mediated looping 
and TADs play critical roles in multiple nuclear 
processes, including regulation of gene expres- 
sion, somatic recombination, and DNA repair 


science.org SCIENCE 


RESEARCH | REPORTS 


A Miche: Cc tle PDE locus labeling B Tracking Fon2 loop dynamics using 3D distance E 3D distance distributions (PDF) 
7 = —3 
panel untagged © TetO ::TetR-3x-mScarlet © Rarer @ CTCF = 47x10 C27 (ATAD) 
ohesin : e — as 
© anchor3 :: EGFP-OR3 ° @ CTCFsite £ , | mean'= 267 nm 
, ‘S 2 C36 (WT) 
oF , m e j mean = 361 nm 
; ae GS SS% Bc C65 (ACTCFsites) 
® Ss éss = “~~ mean = 467 nm 
C36 = Fbn2 looping nf 25s 31 
2 dynamics? m. oN % 3s 
(WT) — 26S § © 
8 ° s % Las Sun a 0 
° 8 ol er 0 250 500 750 1000 1250 1500 
103 Higher 3D distance Lower 3D distance 3D distance, R [nm] 
—_ ae NG C Genome-edited cell lines Representative C36 cell with F Localization error corrected MSD 
a> ipa < i = 
335 L112 R41 CTCF motifs “wr labelled Fbn2 CTCF anchors oe 1_ ___ gs (ACTCFsites; n=147) 
£0 hi i RNAseq C36" Mreeramun eh ahah FIER UA go — C36 (WT; n=491) ae 
g : 5 kb ‘505 kb zB — C27 (ATAD; n=358) (== 
SO hit alana os aiblashldlbal ile SIC 2 Ss. 65 
5,13 C27 wou fome —"ATAD” § 10. t 
Oat acl iy 10 kb Es 
213 2§ 
20 Input “ACTCFsites 3 8 
3 Fon? CEE MOP REApRGreurtan an nuance Ta 
Co = 5» = = > Transcripts 5 kb 505 kb N B19? 
57.6 57.8 58.0 58.2 584 58.6 58.8 2 Reals a ee ween 
Genome position [Mb] 10° Time is] 10 
D Representative C36 trajectory tracking CTCF anchor dynamics 
= 
— 1 i} 
© i i 
oO 1 1 
i if 
£ 1 \ 
2 i 1 
a 1 i} 
Q it 
60 


Fig. 1. Endogenous labeling and tracking of the Fbn2 loop with super-resolution 
live-cell imaging. (A) Fluorescent labeling of Fbn2 loop anchors does not 
perturb the Fbn2 TAD. Shown is a Micro-C contact map comparing the parental 
untagged (C59, top left) and tagged (C36, bottom right) cell lines. Red triangles 
are CTCF motifs with orientation. C36 ChIP-seq shows CTCF (GSM3508478) 
and cohesin (SMC1A; GSM3508477) binding compared with input (GSM3508475). 
Fbn2 is not expressed (RNA-seq GSE123636; annotation: GRCm38). Genome 
coordinates: mm10. (B) Overview of tagging and readout using 3D distance. 


(1). For example, TADs are thought to regulate 
gene expression by increasing the frequency of 
enhancer-promoter interactions within a TAD 
and by decreasing enhancer-promoter inter- 
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actions between TADs (7). However, to under- 
stand how TADs and loops are formed and 
maintained and how they function, it is neces- 
sary to understand the dynamics of CTCF- 
and cohesin-mediated loop formation and 
loop lifetime. 

Although recent advances in single-cell 
genomics and fixed-cell imaging have made it 
possible to generate static snapshots of three- 
dimensional (3D) genome structures in single 
cells (8-13), live-cell imaging is required to 
understand the dynamics of chromatin loop- 
ing (74). Furthermore, previous studies have 
yielded conflicting results as to whether loops 
are well defined in single cells (8-73), perhaps 
because of the difficulty associated with dis- 
tinguishing bona fide CTCF- and cohesin- 
mediated loops from mere proximity that 
emerges stochastically (14). Recent pioneering 
work has visualized enhancer-promoter inter- 
actions (15-17) and long-range V(D)J-chromatin 
interactions (18) in live cells. However, the 


80 


T T 
100 120 


(C) Overview of the genome-edited cell lines (left) and a representative 
maximum intensity projection (MIP) of a cell nucleus showing two pairs of “dots” 
(right). (D) Representative 3D trajectory over time of a dot pair. MIPs of the 3D 
voxels centered on the mScarlet dot (top) and 3D distances between dots 
(bottom) are shown. (E) 3D distance probability density functions of dot pairs 
(n = 32,171, n = 46,163, and n = 13,566 distance measurements for C27, C36, 
and C65, respectively). (F) Localization error—corrected two-point MSD plots 

(n = 358, n = 491, and n = 147 trajectories in C27, C36, and C65, respectively). 


dynamics of loop formation and the lifetime of 
CTCF- and cohesin-mediated loops have not 
yet been quantified in living cells. 

To visualize the dynamics of CTCF- and 
cohesin-mediated looping, we chose as our model 
system the loop holding together the two CTCF- 
bound boundaries of the 505-kb Fén2 TAD in 
mouse embryonic stem cells (mESCs). This 
TAD is verified to be CTCF dependent (19) and 
relatively simple because it only contains a 
single gene, Fbn2, which is not expressed in 
mESCs (Fig. 1A). We used genome editing to 
homozygously label the left and right CTCF 
sites of the Fbn2 TAD with TetO and Anchor3 
arrays, which we then visualized by coexpress- 
ing the fluorescently tagged binding proteins 
TetR-3x-mScarlet and EGFP-OR3 (20) [clone C36 
(Fig. 1, B to D)]. We developed a comprehensive 
image analysis framework, ConnectTheDots, to 
extract trajectories of 3D loop anchor positions 
from the acquired movies (fig. S1). By opti- 
mizing 3D super-resolution live-cell imaging 
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Fig. 2. Degradation of CTCF, cohesin, and WAPL reveals their role in loop 
extrusion and looping-mediated chromosome compaction. (A) Micro-C data 
for the AlD-tagged clones for RAD21 (left), CTCF (middle), and WAPL (right) 
ng control data [no indole-3-acetic acid (IAA) treatment; top half] and 

n degradation data (3 hours after IAA; bottom half), with schematics 
illustrating the expected effect. (B) Representative trajectories with (colored 

or without (gray lines) IAA treatment for each AlD-tagged clone. (C) 3D 
distance probability density functions of dot pairs [n = 45,379, n = 10,469, 

= 18,153 distance measurements for ARAD21 (2 hours), ACTCF 

(2 hours), and AWAPL (4 hours) depletion conditions, respectively, and 

n= 17,605, n = 11,631, and n = 21,001 for the same clones without treatment]. 
(D) Localization error—corrected two-point MSD plots for the AlD-tagged 
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conditions (4), we could track Fbn2 looping 
dynamics at 20-s resolution for >2 hours 
(Fig. 1D and movies S1 to S4). After DNA 
replication in the S/G, phase, it is no longer 
possible to reliably distinguish intrachromo- 
somal from sister-chromosomal interactions 
(14). We therefore filtered out replicated and 
low-quality dots using a convolutional neural 
network (fig. S2). Thus, we only considered G, 
and early S-phase cells. 

To validate our system for tracking Fbn2 
loop dynamics, we performed a series of con- 
trol experiments. First, we confirmed using 
Micro-C (2], 22) that our locus-labeling ap- 
proach did not measurably perturb the Fbn2 
loop (Fig. 1A). Second, to “mimic” the looped 
state, we deleted the 505 kb between the CTCF 
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sites, generating clone C27 (““ATAD”; Fig. 1C). 
As expected, this reduced the 3D distance 
(Fig. 1E; the nonzero 3D distance distribution 
for C27 was expected because of localization 
noise and the 5-kb tether between CTCF sites 
and fluorescent labels; see fig. $3). Third, as a 
negative control for CTCF-mediated looping, 
we generated clone C65 (“ACTCFsites”; Fig. 1C) 
by homozygously deleting the three CTCF 
motifs in the Fbn2 TAD (L1, L2, R1; Fig. 1A) 
and validated that this resulted in a loss of 
CTCF binding and cohesin colocalization using 
chromatin immunoprecipitation sequencing 
[ChIP-seq (fig. S4)]. As expected, the 3D dis- 
tance increased in C65 (Fig. IE). Next, we cal- 
culated mean-squared displacements (MSDs) 
of the relative positions of the two loci (two- 


clones (left) [n = 537, n = 137, and n = 215 trajectories in ARAD21 (2 hours), 
ACTCF (2 hours), and AWAPL (4 hours) depletion conditions, respectively, 
and n = 183, n = 151, and n = 257 
(gray lines)]. The effective tether 
of the steady-state variance of each clone to the value in the RAD21-depletion 
condition (note that 2<R*> is also 
also the supplementary materials). 
mation from simulations mimicking the ARAD21 (95% cohesin depletion) 
(left) and ACTCF (100% CTCF depletion) (right) depletion conditions. 
Red is the unextruded segment, b 
simulations to the data to obtain loop-extrusion parameters (three fit 
parameters; see the supplementary materials). 


for the same clones without treatment 
ength was obtained by computing the ratio 


the asymptotic value of the MSD; see 
(E) Representative 3D polymer confor- 


ue the extruded segment. (F) Matching 


point MSD), which are unaffected by cell move- 
ment. Chromatin dynamics were consistent 
with Rouse polymer dynamics, with a scaling 
of MSD ~ ¢°° for all three clones (23) (Fig. 1F). 
We conclude that our approach faithfully re- 
ports on CTCF looping dynamics in live cells 
without noticeable artifacts. 

To elucidate the specific roles of CTCF and 
cohesin, we endogenously tagged RAD21, 
CTCF, and the cohesin unloader WAPL with 
mAID in the C36 (WT) line, allowing for deg- 
radation with indole-3-acetic acid (24). For 
RAD21 and CTCF, we achieved near-complete 
depletion in 2 hours (fig. S5), long-term deple- 
tion led to cell death (fig. S6), Micro-C analysis 
revealed loss of the Fbn2 loop or corner peak 
as expected (25-28) (Fig. 2A), and ChIP-Seq 
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Fig. 3. BILD reveals rare and dynamic CTCF loops. (A) Example trajectory 
from polymer simulations with loop extrusion. Extrusion shortens the effective 
tether (red is the unextruded length, ground truth from simulations) between 
the CTCF sites. A ground truth loop is formed when the tether is minimal and 
cohesin is stalled at both CTCF sites (black bar). BILD captures these accurately 
(purple bar). (B) Schematic overview of BILD. Building on the analytical solution 
to the Rouse model, we used a hierarchical Bayesian model to determine the 
optimal looping profile for single trajectories. (C) Illustrative examples of inferred 


inferred looped fraction. Gray lines are maximum likelihood fits of a single 
exponential to the data, accounting for censoring. (E) Fraction of time the 
Fbn2 locus spends in the fully looped conformation. Error bars are bootstrapped 
95% confidence intervals. (F) Median loop lifetimes from the Kaplan-Meier 
survival curves (Squares) or exponential fits (crosses). Confidence intervals are 
determined from the confidence intervals on the Kaplan-Meier curve and the 
likelihood function of the exponential fit, respectively. Where the upper 
confidence limit on the survival curve did not cross below 50%, an arrowhead 
indicates a semi-infinite confidence interval. 


looping on real trajectory data. (D) Kaplan-Meier survi 


analysis showed a loss of chromatin binding 
(fig. S7). For WAPL, depletion took 4 hours 
and was less complete (fig. $5), long-term 
depletion occasionally yielded visibly com- 
pacted “vermicelli” chromosomes (29) (fig. 
S6C), and Micro-C analysis revealed increased 
corner peak strength (27, 28, 30) (Fig. 2A). All 
three AID lines exhibited lower protein abun- 
dance, likely because of leaky protein deple- 
tion (fig. S8). 

Having validated the AID cell lines, we next 
performed live-cell imaging to study the specific 
roles of RAD21, CTCF, and WAPL in loop 
extrusion in vivo. Consistent with RAD21 being 
required for loop extrusion, RAD21 depletion 
strongly increased the 3D distances (Fig. 2, 
B and C). Consistent with CTCF being the 
boundary factor that is required for Fbn2 loop 
formation (Fig. 1B) but not required for loop 
extrusion, CTCF depletion increased 3D dis- 
tances, albeit less than RAD21 depletion (6). 
Finally, consistent with prior observations that 


SCIENCE science.org 


val curves rescaled by the 


WAPL depletion increases cohesin residence 
time and abundance on chromatin (29), poten- 
tially allowing it to extrude longer and more 
stable loops (27, 30), WAPL depletion decreased 
the 3D distances (Fig. 2, B and C). 

To quantify the extent of loop extrusion of 
the Fbn2 TAD, we turned to polymer physics 
theory. The Rouse model predicts a linear 
relationship between chain length and mean 
squared distance (<R?>) between the fluores- 
cent labels (dashed lines in Fig. 2D; see also 
fig. S9). By assuming that ARAD21 represents 
the fully unextruded state with a genomic 
separation of 515 kb (Fig. 1C), we could then 
assign an “effective tether length” (i.e., the 
unextruded fraction) to each condition. We 
found an effective tether length of ~200 kb in 
C36 (WT) and ~280 kb in ACTCF, corresponding 
to ~39 and ~54% of the full genomic separa- 
tion, respectively. By subtraction, the genomic 
separation between the two labels shortened 
by ~46% due to extrusion alone (ARAD21 


versus ACTCF) and by ~61% due to extrusion 
with boundaries (ARAD21 versus C36). This 
shows that by blocking extruding cohesins, 
CTCF increases the fraction extruded between 
the two CTCF boundaries. Overall, we estimate 
that, on average, just over half of the Fbn2 TAD 
is extruded. 

By combining these measurements (Fig. 2, B 
to D) with our Micro-C data (Fig. 2A), we were 
then able to determine dynamic parameters 
of our polymer model of loop extrusion (Fig. 2, 
E and F), including spacing between cohesins 
and their processivity, as well as the total 
strength of the CTCF boundaries (see the 
supplementary materials). Consistent with 
our ARAD21 data, our polymer simulations 
resulted in chromosome decompaction after 
near-complete RAD21 depletion (Fig. 2E) and 
accurately matched our experimental data 
(Fig. 2F). 

Next, we sought to identify where and when 


CTCF-CTCF loops occured in our trajectories. 
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Fig. 4. Comprehensive picture of the Fbn2 TAD. (A) Comparison of Micro-C 
data for the C36 (WT) line with in silico Micro-C of our best-fit simulation 

map (left) and contact probability scaling (right). (B) BILD applied to the same 
simulation (green) compared with C36 (WT) experimental data (blue). (©) Number 
of cohesins forming the looped state in simulations (n = 18,789). (D) “Anatomy” of 
the Fbn2 TAD. Quantitative description of the Fbn2 TAD is shown using both real 


Because of localization noise and substantial 
temporal correlations in the data, simple 
analysis methods failed when benchmarked 
on simulations (see the supplementary text). 
Therefore, we developed Bayesian inference 
of looping dynamics (BILD). In BILD, we 
coarse-grained the possible conformations of 
the TAD (Fig. 3A) into two states: (i) a state of 
sustained contact between the CTCF sites, 
presumably mediated by cohesin (the “looped 
state”), and (ii) all other possible conforma- 
tions, including partial extrusion, random 
contacts, and the fully unlooped conforma- 
tion (the “unlooped state”). While the looped 
state relies on CTCF activity, the unlooped 
state reflects extrusion without bridged CTCF 
boundaries, resembling the ACTCF condition. 
On the basis of the MSD~?°° scaling observed 
in Fig. 2D, we modeled the unlooped state as 
a free Rouse chain calibrated to the ACTCF 
data (fig. S10). To model the looped state, we 
introduced an additional bond between the 
two CTCF sites (Fig. 3B); this bond is switch- 
able, allowing transitions between the looped 
and unlooped states. The length of the bond 
was set to reproduce the 10-kb distance be- 
tween the fluorophores, using ARAD21 as 
reference for a free 515-kb chain (see the 
supplementary text). Finally, by using a hierar- 
chical Bayesian model (37), BILD then exploited 
the different spatiotemporal dynamics of the 
looped state to infer which segments of each 
trajectory were in the looped state (purple 
segment in Fig. 3A). When tested on 3D 
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polymer simulations with experimentally 
realistic noise, BILD accurately inferred both 
the looped fraction and the loop lifetime 
(Fig. 2, E and F, and figs. S11 and S12). In 
summary, BILD allows us to distinguish 
CTCF- and cohesin-mediated looping from 
mere proximity. 

We next used BILD to infer looping in our 
experimental trajectory data (Fig. 3, C to F). 
BILD revealed that the Fbn2 TAD was fully 
looped ~6.5% (~3%) of the time, but spent 
~93.5% (97%) of the time in a fully unlooped 
or partially extruded conformation (Fig. 3E). 
We use brackets to indicate the looped frac- 
tion after false-positive correction (fig. S12; 
the corrected looped fraction was ~6% when 
we calibrated BILD using a 15-kb fluorophore 
distance; fig. S13). By contrast, we observed 
a minimal looped fraction of ~2% (~0%) in 
ARAD21 and ACTCF and ~4% (~1%) in C65 
(ACTCFsites), whereas the looped fraction 
was increased to ~10% (~6%) in AWAPL, con- 
sistent with WAPL unloading cohesin from 
chromatin (29). 

Finally, we estimated the lifetime of the 
looped state (Fig. 3, D and F). Accurate mea- 
surement of loop lifetimes from finite trajecto- 
ries can be challenging when trajectories begin 
or end in the looped state, so that it is unclear 
how long the looped period truly lasted (e.g., 
the looped state in the AWAPL trajectory in 
Fig. 3D existed an unknown time before the 
start of the movie). This problem, known in 
medical statistics as “censoring,” can be solved 
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data (blue) and our best-fit simulation (green). Cohesin processivity and density and 
CTCF stalling probability and lifetime boost are simulation parameters. Fraction of 
time in different conformations was extracted from simulation ground truth using 
effective tether lengths of 1.1 and 505 kb as cutoffs to define “fully looped” and “fully 
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using the Kaplan-Meier survival estimator. 
Using this approach, we obtained censoring- 
corrected survival curves (Fig. 3D) of the looped 
state, from which we estimated the median 
loop lifetime (Fig. 3F). Orthogonal to this non- 
parametric analysis, we also fitted an exponen- 
tial model, yielding similar estimates. Together, 
these results give an estimate of the median 
loop lifetime of ~10 to 30 min in C36 (WT) 
(Fig. 3F and fig. S12D). These results revealed 
the fully looped state to be both rare (~3 to 6%) 
and dynamic (median ~10 to 30 min; mean 
~15 to 45 min). Thus, during an average ~12-hour 
mESC cell cycle, the looped state will occur 
approximately one or two times, lasting cumu- 
latively ~20 to 45 min, but the remaining 
~11.5 hours will be in the partially extruded 
or fully unlooped conformations. 

To understand whether a low looped frac- 
tion of ~3% is consistent with a clear and 
strong corner peak in the Micro-C map, we 
set up polymer simulations with loop extru- 
sion. Consistent with recent reports, we 
found that CTCF-mediated stabilization of 
cohesin was necessary to reproduce both of 
these features in our simulations (Fig. 4 
and fig. $14). We confirmed this effect using 
inverse fluorescence recovery after photo- 
bleaching (iFRAP) of cohesin, finding that 
CTCF depletion decreased cohesin residence 
time (fig. S15). Incorporating this effect, we 
then simulated loop extrusion with a cohe- 
sin density of 1/240 kb and processivity of 
150 kb (processivity = lifetime x extrusion 
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speed). When cohesin reaches a CTCF site, it 
has a 12.5% probability of stalling, which, 
using the estimate of 50% CTCF occupancy 
(34), translates into an ~25% capture effi- 
ciency of CTCF. Once stalled on one side by 
CTCF, cohesin is stabilized fourfold beyond 
its base lifetime of ~20 min (35) (fig. S14), 
facilitating the formation of longer loops 
because the other side of cohesin may con- 
tinue to extrude. These simulations repro- 
duced both our experimental Micro-C maps 
(Fig. 4A) and the median loop lifetime and 
low looped fraction (Fig. 4B). 

Together, these results allow us to paint a 
comprehensive mechanistic picture of the 
Fbn2 TAD (Fig. 4, C and D). Most of the time 
(~92%), the TAD is partially extruded, with 
~57 to 61% of the Fbn2 region captured in 
one to three extruding cohesin loops, whereas 
~39 to 43% remain unextruded. The fully 
unlooped conformation, as would be found 
in the absence of cohesin, occurs only ~6% of 
the time, whereas the fully looped state is 
even more rare at ~3% (~2% in simulations) 
and has a median lifetime of ~10 to 30 min. 
Our simulations reveal that the looped state is 
sometimes held together by multiple cohesins 
(Fig. 4C), which also explains why the loop 
lifetime can be substantially shorter than 
the CTCF-stabilized cohesin lifetime. We stress 
that both the mechanistic assumptions of 
our polymer models and the experimental 
data constraining them are associated with 
uncertainty, resulting in uncertainty of the 
inferred parameters (Fig. 4D). For example, 
if we allow extruding cohesins to bypass each 
other in our simulations (36, 37), then our 
estimates of the fold stabilization of cohesin by 
CTCF would change from approximately four- 
fold to approximately twofold, the CTCF stall- 
ing probability would change from 12.5 to 
25%, and the looped state would now be held 
together by a single cohesin (fig. S16). We also 
note that TADs smaller than the 505-kb Fin2 
TAD and TADs with stronger CTCF bounda- 
ries may have a higher looped fraction (38). 
Accordingly, we propose that our absolute 
quantification of the Fbn2 looped fraction may 
now allow calibrated inference of absolute 
looped fractions genome wide on the basis of 
Micro-C (73). 

Our findings reveal that the CTCF- and 
cohesin-mediated looped state that holds to- 
gether CTCF boundaries of TADs is rare, dynam- 
ic, and transient. A key limitation of our 
study is that it represents just one loop in 
one cell type. Nevertheless, the Fbn2 loop is 
among the strongest quartile of “corner peaks” 
in Micro-C maps, suggesting that most other 
similarly sized loops in mESCs are likely 
weaker than Fbn2 (fig. S17). Our results thus 
rule out static models of TADs that exist in 
either a fully unlooped state or a fully looped 
state stably bridged by one cohesin (Fig. 1B). 
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Instead, we show that the Fbn2 TAD most 
often exists in a partially extruded state formed 
by a few cohesins in live cells (~92%; Fig. 4D), 
and that when the rare looped state is formed, 
it is transient (~10 to 30 min median lifetime; 
Fig. 4B). Because the partially extruded state 
dominates, this may be the functionally im- 
portant TAD state. Thus, we suggest that CTCF- 
mediated transcriptional insulation may be 
mediated more by individual extrusion-blocking 
CTCF boundaries than by the rare fully looped 
state. Similarly, this suggests that regulatory 
interactions, such as those between enhancers 
and promoters, may depend more on frequent 
cohesin-mediated contacts within a TAD rather 
than rare CTCF-CTCF loops. This dynamic pic- 
ture of TADs in live cells (Fig. 4D) may also help 
to explain cell-to-cell variation in 3D genome 
structure, and consequently stochasticity in 
downstream processes such as gene expres- 
sion and cell differentiation. 
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HYDROGELS 


Piezoionic mechanoreceptors: Force-induced current 


generation in hydrogels 


Yuta Dobashi*2*, Dickson Yao’, Yael Petel*, Tan Ngoc Nguyen*®, Mirza Saquib Sarwar", 
Yacine Thabet’, Cliff L. W. Ng", Ettore Scabeni Glitz’, Giao Tran Minh Nguyen®, Cédric Plesse®, 
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The human somatosensory network relies on ionic currents to sense, transmit, and process tactile information. 
We investigate hydrogels that similarly transduce pressure into ionic currents, forming a piezoionic skin. 

As in rapid- and slow-adapting mechanoreceptors, piezoionic currents can vary widely in duration, from 
milliseconds to hundreds of seconds. These currents are shown to elicit direct neuromodulation and muscle 
excitation, suggesting a path toward bionic sensory interfaces. The signal magnitude and duration depend on 
cationic and anionic mobility differences. Patterned hydrogel films with gradients of fixed charge provide 
voltage offsets akin to cell potentials. The combined effects enable the creation of self-powered and ultrasoft 
piezoionic mechanoreceptors that generate a charge density four to six orders of magnitude higher than 


those of triboelectric and piezoelectric devices. 


uman touch perception relies on soft 

cutaneous mechanoreceptors that intri- 

cately interface with even softer neural 

tissues. In advanced prosthetics, robot- 

ics, and wearable devices, the presence 
of sensor arrays that mimics human skin has 
led to elastomer-based piezoresistive (J), ca- 
pacitive (2), and electret (3) sensors and elec- 
tronic skins (4-6). These sensors enable soft 
interactions and can overcome the device-to- 
tissue mechanical mismatch that is inherent 
to conventional electronic devices. Ionic skin 
presents an alternative to electronic skin, 
producing ultrastretchable, tough, and trans- 
parent devices, including capacitive sensors, 
from hydrogels (7-9). In this study, we used 
hydrogels to directly produce ionic currents in 
response to pressure gradients—a piezoionic 
approach to sensing. This method comple- 
ments other recent iontronic approaches such 
as the use of supercapacitive gates and highly 
conductive hydrogel-conducting polymer hy- 
brids, enabling the large ionic charge ex- 
change at the low voltages needed for neural 
interfaces (10, 11). Piezoionics are shown to 
offer similarly high-charge density and low- 
voltage operation. Like their biological coun- 
terparts, they rely on ionic currents. Unlike 
mechanoreceptors, in which a transmembrane 
Donnan potential is released by mechanical 
gating (Fig. 1A), piezoionic sensors operate by 
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pressure-driven ion flux (Fig. 1B). This occurs 
either by streaming a single polarity of ions 
or by one polarity of ions being preferentially 
carried over the counterions through the poly- 
mer matrix, creating a net charge imbalance. 

Streaming and other pressure-driven voltage- 
generation methods have been documented 
in ion conductors such as Nafion (72), poly- 
pyrrole (73), cartilage (/4), and hydrogels 
(15-17). Upon deformation of such piezoionic 
matrices, the open-circuit voltage gradient, 
VV sense iS proportional to the applied pres- 
sure gradient, VP (Fig. 2A). In the simplest 
case, the relationship can be described as 
VV sense = AVP, where o is the piezoionic co- 
efficient (see full discussion in the supplemen- 
tary materials). « ranges from 0.01 to 100 nV/ 
Pa, depending on the ionic species and matrix 
properties (table S2). It is equivalent to the 
ratio L,/o proposed by de Gennes et al. (12) 
to describe field generation in ionomeric 
polymers exposed to a pressure gradient, 
where L, represents the coupling between 
the applied pressure and the induced ionic 
current, and o is the ionic conductivity. 

To explore the molecular origins of this ef- 
fect and investigate applications in sensing, 
we designed an indentation experiment in 
which the open-circuit voltage or short-circuit 
current was measured between the deformed 
and undeformed portions of the gels (Fig. 2A 
and fig. S1). Indentation is analogous to the 
effect of a finger or other object pressing into 
skin. Two voltage responses to a very light 
step compression of 20 kPa are shown in Fig. 
2B for acrylamide (AAm)-based charged co- 
polymers that contain acrylic acid (AA) and 
proton counterions. This ultra-absorbent and 
biocompatible hydrogel, which is ubiquitous 
in commercial uses, generates several milli- 
volts as a result of streaming potentials (15). 
Transient response time depends on polymer 
content, with the lower polymer content (15% 


by weight, black curve) producing a faster rise 
(1 s) and decay (15 s) than the higher polymer 
content version (30%, blue curve), which ex- 
hibits a slow rise (15 s) and little decay. The 
halving of the polymer content changes com- 
pressibility, leading to much larger relative 
stress relaxation (fig. S5A). This suggests that 
the faster response in the lower-content hy- 
drogel is due to a greater permeability of the 
hydrogel matrix, increasing within-pore flow 
that carries protons and, in turn, producing a 
faster rise. The decay in response is also faster 
as the flow completes sooner. 

A Poisson-Nernst-Planck model was com- 
bined with poroelastic mechanics (see sup- 
plementary materials) to help interpret the 
transient behavior (purple and red model-fit 
curves in Fig. 2B). The model indicates that 
the application of a pressure gradient creates 
an initial ionic current, driven by the internal 
convection of water and protons within the 
pores. This movement of charge creates con- 
centration and voltage gradients that, in turn, 
lead to currents that oppose the convection- 
driven currents (Fig. 2, C and D; movies S8 to 
S10; and fig. S19). Peak voltage is reached 
when the back current, composed of diffu- 
sive and electrophoretic fluxes, balances the 
convective current. When electrolyte flow 
subsides, the transient separation of charge 
decays. The decay is much slower in the den- 
ser polymer (Fig. 2B), owing to the higher 
flow resistance and longer poroelastic time 
constant. This time constant is given by 
= eee where 1 is the fluid viscosity 
within the pores of the hydrogel, LZ is the 
indenter size or characteristic length scale 
of deformation, Gis the shear modulus, v is 
Poisson’s ratio, and « is the matrix perme- 
ability. G increases and « decreases with 
increasing polymer content, with both being 
very strong functions of polymer volume frac- 
tion. Overall, the time constant of flow gets 
longer as polymer content increases. The 
simulation fits with the expectation that dif- 
ferences in matrix permeability and stiffness 
account for variations in rise and decay be- 
haviors (model response in Fig. 2B, dashed 
lines). This understanding can be exploited to 
create a family of decaying or nearly steady 
responses of either positive or negative po- 
larity, as simulated in fig. $25. The polarity 
is reversed by changing the sign of the mo- 
bile ion—for example, by replacing AA with 
[2-(methacryloyloxy)ethyl]trimethylammonium 
chloride (MAETAC) (fig. S4). A rich set of 
voltage responses is obtained as displace- 
ments and forces are increased (figs. S3, S4, 
and S7), with voltage amplitude increasing 
and transient response time changing. A fully 
comprehensive set of reasons for the force 
dependence of the decay is unclear and will 
require careful characterization of the changes 
in permeability and viscoelastic response as a 
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function of deformation, as discussed in the 
supplementary materials. 

The streaming potentials that generate the 
responses plotted in Fig. 2B are reliant on 
having a single mobile charge within the elec- 
trolyte, balanced by a fixed charge of opposite 
sign on the polymer matrix. However, recent 
results (18-22) show that voltages are gener- 
ated even in the absence of fixed charge. This 
effect is confirmed in an uncharged poly- 
acrylamide (pAAm) hydrogel swollen with 0 
to 1.5 M aqueous sodium chloride, whose peak 
sense voltage and ionic conductivity both mo- 
notonically increase as a function of concentra- 
tion (Fig. 2E). No model has been developed 
to explain this response. The most straight- 
forward conclusion is that the chloride ions 
displace faster than the sodium ions under 
indentation. This could be due to several ef- 
fects: filtration resulting from size differences 
between ions (23), differential affinity to the 
matrix, or fluidic hindrance effects in which 
ion size is comparable to boundary layer di- 
mension (24). Given that pore sizes in the hy- 
drogels are within a factor of 10 of solvated 
ion sizes, and that filtration of potassium and 
chloride is evident even in pAAm hydrogels 
with low polymer content (23), we hypothe- 
sized that the physical barrier to diffusion 
provided by the polymer matrix is a factor that 
influences the generation of net current. 

To observe the relationship between ionic 
mobilities and piezoionic response, experi- 
ments were conducted in poly(vinylidene 
fluoride-co-hexafluoropropylene) (PVDF-HFP) 
solid polymer electrolyte swollen with 0.1 to 
3.0 M lithium bis(trifluoromethanesulfonyl) 
imide (LiTFSD in propylene carbonate (PC). 
The salt was chosen because the lithium and 
fluorine enable diffusion coefficients to be 
readily extracted by nuclear magnetic reso- 
nance (NMR) (see supplementary materials). 
As salt concentration increases, there is also 
a reduction in the solvation shell and hydro- 
dynamic radius of the lithium ion, from a 
coordination number of four in its first shell 
at dilute concentrations down to two PC mol- 
ecules at high concentrations (25). This re- 
duction is apparent in the increase in lithium 
diffusion coefficient with concentration, and 
relative to TFSI, as shown in Fig. 2G. The re- 
duction occurs despite increasing solution vis- 
cosity (fig. S17). The diffusion coefficients of 
the two ions cross over, with TFSI’ dominant 
at low concentrations and Li* at high concen- 
trations. There is a corresponding polarity 
shift in the piezoionic voltage. The amplitude 
and sign of the sensor response to a sinusoidal 
indentation (0.1 Hz, 5% compressive strain) 
are shown in Fig. 2H (based on data plotted in 
fig. $13, A to H). The voltage is positive at low 
concentrations but crosses over to negative 
polarity between 1.5 and 2.0 M—the same 
range in which diffusion coefficients cross 
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over. The apparent match, within experimen- 
tal uncertainty, between the diffusion cross- 
over point and the zero-sensing concentration 
is notable and supports the theory that the 
more-mobile and less-hindered ions deter- 
mine the sign of the voltage response. 
This differential effect is not explained by 
Darcy’s continuum flow theory, which as- 
sumes that all molecules are carried at the 
same speed as the solvent. Given that the flow 
within the hydrogels is in a transition regime, 
it should not be surprising that interaction 
with the backbone leads to differential ion 
transport rates and currents. We propose a 
mechanistic picture of piezoionic electrome- 
chanical response in hydrogels with two mobile 
ions, where the cations and anions do not 
move at the same speed as the solvent, leading 
to current and voltage generation. The mobile 
ions are subject to a hydrodynamic drag force 
from the convection of electrolyte within the 
pores. This is balanced by interactions with 
the stationary polymer that hinder transport. 
If the effective mobilities of the ions within the 
matrix, as inferred by diffusion coefficients D, 
for cations and D_ for anions, are the same as 
they would be without the additional drag due 


Sensory 
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Mechanically g 
ion trans} 
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to the polymer, as given by D,, and D,_, then 
we expect that no voltage would be generated, 
as both ions are carried at the speed of the 
fluid, v;. Additionally, if cations and anions are 
equally hindered by the polymer, we expect 
that no sense voltage would be generated. 
These expectations lead to a predicted pie- 
zoionic coefficient that is proportional to the 
difference in the two diffusion coefficients, 


P. _ D_) (see the supplementary 
Ds 


materials for derivation). Here, e is the elec- 
tronic charge (in coulombs), N is the con- 
centration (per cubic meter), « is the matrix 
permeability (in square meters), and 7 is the 
fluid viscosity (in pascal seconds). The rela- 
tionship suggests that when the diffusion co- 
efficients of the two ions are approximately 
equal, the voltage generation will be zero (Fig. 
2, G and H). It also suggests that by max- 
imizing difference in ion size (and thereby 
drag due to the matrix), a larger sense voltage 
will be obtained. The use of bulk solution 
values for D,, and D,_ is a starting point but 
does not provide a quantitative match be- 
tween experiment and theory because the 
polymer affects the interactions between ions 
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Fig. 1. Schematic representation comparing biological sensory transduction and piezoionics, and 
conceptual depiction of a piezoionic sensing device as an iontronic neuroprosthetic. (A) Mechanoreceptors 
have anchors that attach the membrane to the extracellular matrix as well as a cytoskeleton that both stretch 
an ion channel, causing an influx of sodium ions upon deformation. Once the cell membrane potential increases, 
voltage-sensitive ion channels (not shown) open to allow reversal to resting potential. The rapid- and slow-adapting 
mechanoreceptors differ, in part, in the mechanical structures of the cell. (B) Piezoionic skin assembly has a 
built-in potential difference set by the difference in fixed charge concentration between poly(acrylic acid) (polyAA; 
charged) and polyacrylamide (pAAM; neutral). Compression of the charged side creates a flux of solvent and 


protons that increases this potential difference. 
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and solvent. This is observed in the ionic con- 
ductivity, for which that of the polymer gel is 
higher than that of the bulk electrolyte at high 
salt concentrations (>1.0 M), peaking near the 
sensor crossover point (Fig. 2F), and the dif- 


fusion coefficients in bulk solution correspond- 
ingly drop below those in the matrix, crossing 
over at 1 M (fig. S14). The polymer matrix 
reduces ion-ion and ion-solvent interactions 
(26, 27). The lower viscosity reduces drag, 


whereas effective ion concentration is increased 
by reducing ion pairing, hence increasing 
overall conductivity. This observation means 
that we cannot readily measure the hydro- 
dynamic component of drag on ions within 
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Fig. 2. Piezoionic mechanisms and electrical responses. (A) Schematic of the PVDF-HFP solid polymer electrolytes (blue) and bulk LiTFSI/PC solution 


a polymer gel under indentation, exhibiting differential ionic displacement and 
field, with smaller red cations being carried through the green polymer chain 
network faster than the blue anions, generating a charge imbalance and electric 
field. In the indentation experiments, a working (sense) electrode is placed 
under the indented portion, whereas a reference (ground) electrode is placed 
in the undeformed portion of the gel. In this case, a preferential displacement 
of the cations over anions results in a negative voltage reading. (B) Poly(acrylic 
acid-co-acrylamide) hydrogel voltage response upon step compression at 

20 kPa. Transient behavior slows with increasing polymer content (15 and 

30% w/v). Dashed fit curves were generated using a finite element simulation. 
(C) Simulation of the contributing components of ionic current at open circuit 
during the 15% poly(AA) gel compression, and (D) simulated breakdown 

of the ionic transport during the 30% poly(AA) gel compression, showing that 
convection dies off faster in the gel with lower polymer content. (E) lonic 
conductivity of 15% w/v pAAm gel swollen with NaCl of varying concentrations 
and peak generated voltages (black; data are mean + SD, N = 3) under 

10% sinusoidal compression at 0.1 Hz, showing increased signal at higher 
concentrations (red; data are mean + SD, N = 3). (F) lonic conductivity of 
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(green) as a function of concentration, showing that the polymer electrolyte is 
higher in conductivity at high concentrations (data are mean + SD, N = 3). 
(G) Diffusion coefficients of Li* (pink) and TFSI” (blue 
measured by pulsed-field—gradient NMR, showing that the TFSI” is more diffusive 

at low concentrations, and Li* at higher concentrations (data are mean + SD, N = 3). 
(H) Peak voltages generated by 0.1-Hz, 5% sinusoidal 
HFP solid polymer electrolytes with PC/LITFSI! at varying concentrations, showing a 
reversal in sense voltage at a concentration similar to that of the diffusion crossover 
point (data are mean + SD, N = 3). (I) Piezoionic transient response under a step 
compression from 15 and 30% pAAm swollen in 1.5 M NaCl. Note the faster decaying 
response both at open circuit and short circuit when the polymer content is lower. 
(J) Piezoionic transient response under repeated step compressions onto a 200-um- 
thick (top) and a 1-mm-thick (bottom) pAAm gel swollen with 1.5 M NaCl, exhibiting 
decay times of ~50 ms and ~5 s, respectively, emulating rapid- and slow-adapting 
mechanoreceptors. (K) Peak current and power transfer as a function of resistive load, 
using a pAAm gel swollen with 1.5 M NaCl (data are mean + SD, N = 3). (L) Energy 
harvested and electromechanical coupling as a function of external load, using a pAAm 
gel swollen with 1.5 M NaCl (data are mean + SD, N = 3). 


ions in the polymer electrolyte, 


compressive strain in PVDF- 
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the pores. Nonetheless, the derived expres- 
sion for piezoionic coefficient provides useful 
predictions regarding the effects of perme- 
ability, viscosity, ion size, concentration, and 
conductivity. For example, increased ion con- 
centration leads to larger sense voltage (Fig. 2E). 

A faster transient piezoionic response is ob- 
served in the pAAm-based hydrogel when 
polymer content is reduced (Fig. 21). The 
higher fluid permeability (fig. S5) again leads 
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Fig. 3. Properties and operation of a 16-element piezoionic mechano- 
receptor array. (A) Mechanoreceptor unit offset voltage as a function of the 


AA content, showing an increase in “resting” potential 


charge increases (data are mean + SD, N = 3). (B) Indentation induced peak 
piezoionic voltages as a function of the AA content, showing that increased 

AA content results in increased magnitude of the generated voltage (data are 
mean + SD, N = 3). (C) Peak generated voltage as a function of applied pressure 
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to a faster poroelastic time constant in the 15% 
film than in the 30% film, as expected. A faster 
response is also observed as the gel thickness 
decreases. The current response time decreases 
from ~5 s at 1-mm thickness to 30 ms at 
0.2 mm (Fig. 2J; close-up view of current re- 
sponse in fig. S10), owing to the local pressure 
spreading over a larger area as the hydrogel 
thickness increases, and hence the pressure 
gradient decreasing, as supported by simula- 
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tion results in figs. S11 and S12, as well as 
movies S11 to S13. The transient response 
time varies greatly, from 30 ms at 0.2-mm 
thickness to >10 s at 2-mm thickness (fig. S9), 
similar to the response range (3 ms to 2 s) for 
rapidly- to ultraslowly-adapting human mech- 
anoreceptors (28). Cutaneous mechanorecep- 
tors attain different frequency responses by 
using specialized encapsulations around nerve 
endings to spatiotemporally modulate the 
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(data are mean + SD, N = 3). Fit lines represent smoothing spline fits for 

(A) to (C). (D) Photograph of a 16-element piezoionic mechanoreceptor array on 
a wrist, demonstrating conformability. (E) Photograph and (F) corresponding 
normalized voltage bar plot of the piezoionic mechanoreceptor array detecting a 
single touch (movie S1). (G@) Photograph and (H) corresponding normalized 
voltage bar plot of the piezoionic mechanoreceptor array detecting multiple 
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which matches the internal ionic resistance 
of the sensor. Perhaps unsurprisingly, these 
materials are not efficient energy harvesters. 
Electromechanical coupling was highest at 
0.3% (Fig. 2L), as measured by applying a 10% 
compression at 1 Hz on a 1-mm-thick pAAm 
hydrogel swollen with 1.5 M NaCl. In com- 
parison with stretchable piezoelectric and 
triboelectric devices (3, 9), piezoionic voltage 


receptor deformation upon a given pertur- 
bation (28, 29), whereas piezoionic current 
spikes are modified by permeability, modulus, 
and dimensions (as described by the poroe- 
lastic time constant; eq. S15). 

Energy output of the piezoionic sensors is 
maximized by using classical impedance match- 
ing. Peak power transfer occurs when an exter- 
nal resistor of 500 ohms is employed (Fig. 2K), 
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Fig. 4. Demonstration of piezoionic neuromodulation in a rodent model. (A) Setup for the piezoionic 
peripheral nerve stimulation experiment, showing electrode placement. (B) Area-normalized current 
responses to a square wave (peak-to-peak voltage = 5 mV) applied to the stainless-steel electrodes (red) and 
PEDOT-coated Pt-lr electrodes (black) in a saline bath. Note the significant increase in the peak current 

as well as the steady-state transients using the PEDOT electrodes. (C) Sensor voltage and EMG signal reading 
during piezoionic stimulation, with stainless-steel electrodes at the sciatic nerve. The EMG signal lags the 
sensor input by around 5 to 8 ms as a result of propagation delay. (D) Piezoionic tibial nerve stimulation. Currents 
are generated by tapping on the piezoionic sensor directly, eliciting hindlimb movement (movie S5). 
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output is much lower, but peak power per 
volume is similar at 0.85 p W/cm? (Fig. 2L). 
Further exploration of dimensions, as well 
as electrolyte and polymer content, creates 
opportunities for investigating the upper lim- 
its of power density in piezoionic materials. 
These should scale well as dimensions are 
reduced, because the poroelastic time con- 
stant (eq. S15) is proportional to the square 
of the dimensions. Unlike piezoelectric (30) 
and triboelectric (9) generators, piezoionics 
can yield very high charges—in this case, 
80 uC/em? or 800 uC/em*. In PVDF, the charge 
per area is only 20 pC/cm? or 200 pC/cm? for 
the same sample dimensions and applied pres- 
sure. The highly stretchable electrets and 
triboelectric generators produce 4 nC/cm? 
and 0.01% of piezoionics in terms of charge per 
volume (9, 30). Further comparisons of charge 
and power output are made with piezoelectric 
and triboelectric devices in the supplemen- 
tary materials. The high charge density of 
piezoionics is particularly attractive for use in 
neural interfaces, where charge injections at 
tens of microcoulombs per square centimeter 
are routinely required while maintenance of 
low voltages is a key safety requirement (37). 
Piezoionics’ similarities to mechanorecep- 
tors in terms of charge, voltage, and time 
response led us to explore the creation of a 
piezoionic skin that can interact with the 
nervous system. We began by emulating trans- 
membrane potential. Sodium-potassium pumps 
maintain a resting transmembrane potential 
of approximately —70 mV, which is partially 
depolarized and repolarized in a mechano- 
receptor cell. The threshold potential change 
to produce an action potential is ~15 mV. To 
create a resting potential analog in the hy- 
drogels, a boundary between a region of fixed 
polymer backbone charge and one of uncharged 
matrix was built into a hydrogel (Fig. 1B) The 
resulting artificial mechanoreceptor is com- 
posed of a 2-mm diameter, 2-mm-tall poly(AA- 
co-AAm) hemisphere that is surrounded in 
plane by pAAm swollen with 0.1 M KCl and 
formed into an array of 16 individually ad- 
dressable touch sensors on an 8-mm pitch 
(Fig. 3D). An orange dye is used to identify the 
acrylic AA regions in the transparent sensor. 
In this case, protons are the only mobile ions, 
released by the dissociation of the AA. The 
immobility of the negative charge on the AA 
yields a negative Donnan potential relative to 
the AAm region, as protons diffuse into the 
pAAm plane. By increasing the ratio of AA to 
AAm units in the charged region, this built-in 
“resting potential” becomes as large as -50 mV 
(Fig. 3A). By applying a light pressure (0.5-Hz 
compressive sinusoidal indentation at 20% 
strain, corresponding to ~20 kPa) over the 
hemispheres, displaced water entrains the pro- 
tons, and the potential becomes more negative, 
with larger response for a higher-AA content 
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(Fig. 3B). The pressure sensitivity of a single 
mechanoreceptor unit is 8 uV/kPa (a = 8 x 
10° V/Pa), probed with pressures of up to 
360 kPa using a 0.1-Hz sinusoidal perturbation 
(Fig. 3C). A 4-cm-by-4-cm sensor array was 
tested under single- and multitouch condi- 
tions (Fig. 3, Eto H and movies S1 and S82), 
as well as in detection of a swiping gesture 
(movie 83). A gentle finger press (~100 gram- 
force) generates voltage changes of approxi- 
mately -10 mV, which are superimposed on 
the -50-mV Donnan potential. Little cross- 
talk between mechanoreceptors was observed. 
By changing the film thickness or polymer con- 
tent or using current response instead of volt- 
age, the properties of this transient response— 
including the decay time, which is ~1 s with 
the current dimensions—can be made faster 
or slower. The desired time constant depends 
on the application. If the aim is to detect 
change but ignore steady forces, then a fast 
time response is desired—for example, creat- 
ing a situation in which the skin “forgets” 
about a constant force, just as we might stop 
being aware of clothing or a persistent noise. 

This piezoionic skin may be suitable for soft 
robotics, medical devices, or wearable appli- 
cations, given its mechanical compliance and 
self-powered nature. We demonstrate piezo- 
ionic detection of joint flexion and extension 
(fig. S27 and movie S6), for which the com- 
pliant nature makes it unobtrusive. We also 
detect the touch of a finger on skin and ona 
toy (movie S7). To be effective in such dry ap- 
plications over hours or longer, the sensor 
needs a stretchable encapsulation layer. Eco- 
flex 00-30 was employed for this purpose (see 
supplementary materials), enabling up to sev- 
eral days of operation, which is sufficient for 
many wearable applications. Figures S29 and 
S30 demonstrate 390 cycles, where some drift 
is observed, returning to normal after rest- 
ing, without degradation in performance. 
Switching to styrene-isobutylene-styrene block 
copolymers with low rates of water-vapor 
transmission should allow for months or even 
years of operation without dehydration (32), 
as could the use of lithium salts or ionic liquids 
(33). Owing to the bioinert nature of many 
synthetic hydrogels, it is also conceivable to 
employ piezoionics in implantable devices. 
Other advantages are the simplicity of the 
device and its adaptable compliance. An arte- 
rial phantom (fig. S28) demonstrates the sen- 
sor detecting a pulsatile flow that simulates 
cerebrovascular hemodynamics (34). 

We performed peripheral nerve stimulation 
to demonstrate the possibility of self-powered 
piezoionic neuromodulation. A piezoionic sen- 
sor element composed of a pAAm hydrogel 
swollen in 1.5 M NaCl was pressed by hand to 
generate tactile-induced current pulses. Con- 
nections to the sciatic or tibial nerves of rodent 
models were made via conventional stainless- 
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steel needle electrodes (250-um diameter) 
or poly(3,4-ethylenedioxythiophene) (PEDOT)- 
coated (~1 um) Pt-Ir wires (75-um diameter), 
as shown in Fig. 4A. The PEDOT-coated Pt-Ir 
wires were gently wrapped around the tibial 
nerve, whereas the stainless-steel needles were 
placed by piercing the perineurium of the 
sciatic nerve. In either case, the distance be- 
tween the contacts was 2 to 4 mm. Because 
the charge capacity of metals is relatively 
small [~0.2 F/m? (35)], conducting polymer 
PEDOT was used to increase charge injec- 
tion capacity (Fig. 4B), as discussed in the 
supplementary materials. An electromyogram 
(EMG) was concurrently recorded by inserting 
a pair of needle electrodes in the gastrocne- 
mius muscle. 

A series of presses was then applied to the 
piezoionic sensor connected to the peripheral 
nerves. Using the stainless-steel electrodes, we 
did not observe a visible twitch of the hind- 
limb but did note a clear EMG signal, indicat- 
ing that muscle is activated (Fig. 4C) but 
insufficient to attain a visual confirmation. 
The EMG signal lagged the voltage generated 
by the sensor by ~8 ms, as expected given the 
conduction velocities. Nonetheless, amplifica- 
tion of the sensor signal via analog-to-digital 
conversion into biphasic pulse trains enabled 
observation of hindlimb twitches (movie S4). 
Use of the PEDOT-coated Pt-Ir electrodes with 
the nerve, with the resulting currents in excess 
of 10 uA and well under 100 mV, produced 
clearly visible hindlimb twitches without any 
signal conditioning or amplification (Fig. 4D 
and movie S5). 

Piezoionics offer sensing solutions that are 
soft, self-powering, and biocompatible. In re- 
sponse to pressure, they can be tailored to pro- 
duce a wide temporal range of transient signals 
and provide much higher charges at much 
lower voltages than piezo- and triboelectric 
devices. Consequently, piezoionics are a good 
match for neural interfacing applications. 
Piezoionic devices are another iontronic tool, 
part of a system that, like nature, also includes 
computation, actuation, and energy storage. 
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Food wanting is mediated by transient activation of 
dopaminergic signaling in the honey bee brain 


Jingnan Huang"+, Zhaonan Zhang';+, Wangjiang Feng*t, Yuanhong Zhao'{, Anna Aldanondo?, 
Maria Gabriela de Brito Sanchez”, Marco Paoli, Angele Rolland“, Zhiguo Li’, Hongyi Nie’, Yan Lin’, 


Shaowu Zhang®, Martin Giurfa’?**§, Songkun Su?*§ 


The biological bases of wanting have been characterized in mammals, but whether an equivalent 
wanting system exists in insects remains unknown. In this study, we focused on honey bees, which 
perform intensive foraging activities to satisfy colony needs, and sought to determine whether 
foragers leave the hive driven by specific expectations about reward and whether they recollect 
these expectations during their waggle dances. We monitored foraging and dance behavior and 
simultaneously quantified and interfered with biogenic amine signaling in the bee brain. We show 
that a dopamine-dependent wanting system is activated transiently in the bee brain by increased 
appetite and individual recollection of profitable food sources, both en route to the goal and during 
waggle dances. Our results show that insects share with mammals common neural mechanisms for 
encoding wanting of stimuli with positive hedonic value. 


he study of how brains encode pleasure 

(1, 2) has uncovered the neural and 

molecular underpinnings of hedonic 

(i.e., pleasant) experiences such as 

reward. Psychological components in 
the processing of appetitive rewards have 
been identified (3) and traced to dissociable 
circuits in the mammalian brain (J): (i) “liking,” 
which refers to the actual pleasurable impact 
of reward consumption; (ii) “wanting,” which 
refers to the motivation to reach the reward; 
and (iii) “learning,” which includes the im- 
plicit and explicit information about reward 
acquired through individual experience (4). 
In mammals, the process of wanting is me- 
diated by neural systems that include meso- 
limbic dopamine (J, 5). 

Despite the fact that invertebrates exhibit 
sophisticated reward-seeking behaviors (6), 
the existence of an equivalent wanting system 
driving food search remains unknown. Honey 
bees are an attractive organism for the study 
of this topic owing to their intensive foraging 
activities, their developed social organiza- 
tion, and the presence of a symbolic code for 
information transfer about appetitive rewards 
(7) known as the waggle dance, which conveys 
information about direction and distance of 
profitable food sources (8). The existence of 
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this code allows us to explore whether, besides 
communicating vectorial information point- 
ing toward a food source, dancers transiently 
recollect the hedonic properties of the ex- 
ploited food source. Does a wanting system 
drive foragers to profitable food sources and 
do waggle dances reactivate this system through 
the reminiscence of the reported food source? 
To answer these questions, we monitored 
foraging and dance behavior of identified 
foragers and simultaneously quantified and 
interfered with biogenic amine signaling in 
the bee brain during key phases of foraging 
and communication cycles. We demonstrate 
that dopamine levels increase in the brain of 
active dancing foragers both during dance 
initiation and upon departure toward an at- 
tractive food source. Individual hunger also 
elevates dopamine levels in the brain of 
foragers, thereby enhancing the hedonic value 
of sucrose solution and improving appetitive 
olfactory learning and memory retrieval. We 
thus provide evidence of the existence of a 
dopamine-based wanting system in the bee 
brain that is triggered by both individual 
hunger and personal recollection of the hedonic 
value of food sources in the context of the 
waggle dance. 

We first trained honey bees (Apis mellifera) 
from an observation hive to collect sucrose 
solution from an artificial feeder. Trained, 
tagged bees flew regularly between the hive 
and the feeder for a period of 3 days. On day 
three, bees consistently reporting the pres- 
ence of the feeder via regular waggle dances 
were identified. On day four, these bees were 
captured in different contexts and _ situa- 
tions. At the feeder, tagged dancers were 
captured either upon arrival and before they 
started feeding or ~1 min after landing and 
starting feeding on the available sucrose solu- 
tion. Within the hive, dancers were captured 


either immediately after dance initiation, upon 
dance ending (typically 50 to 60 s after dance 
start), or when departing for the food source 
(typically 10 to 20 s after dance end). We also 
collected “cessation” dancers, that is, bees that 
danced when the feeder was available but 
stopped dancing after the feeder was kept em- 
pty for 30 min. 

Captured bees were immediately frozen in 
liquid nitrogen. High-performance liquid chro- 
matography was performed on individual 
brains to quantify biogenic amine levels (nano- 
grams per brain). Reliable measurements 
across all experimental conditions were ob- 
tained for dopamine (DA) and serotonin [5- 
hydroxytryptamine (5-HT)]. A highly significant 
interaction was found between the factors 
“biogenic amine” and “behavioral condition” 
(Fig. 1A) (F4,130 = 15.53, P < 0.0001). Whereas 
serotonin did not change with the phase of the 
foraging cycle or dance, significantly higher 
levels of dopamine were observed upon arrival 
at the food source before the bee started feed- 
ing and during waggle dance initiation (Tukey 
tests, P < 0.001). Although dopamine levels 
decreased at the end of dancing, foragers 
arrived at the feeder with increased levels of 
this amine. This increase was present upon 
departure from the hive, as shown by dopamine 
levels of foragers flying regularly between the 
hive and the feeder over the course of 3 days 
and captured on day four at the hive entrance 
after dance end and when preparing to leave 
the hive (Fig. 1B, “hive departure”). Equivalent 
levels of dopamine were found in these bees 
upon landing at the feeder (Fig. 1B, “feeder 
arrival”). Serotonin levels remained lower in 
both groups of bees. As a result, the inter- 
action between the factors “biogenic amine” 
and “behavioral condition” was not significant 
(Fi90 = 0.12, P = 0.73). A significant variation 
was found for “biogenic amine” (F,,99 = 261.32, 
P < 0.0001) but not for “behavioral condition” 
(Figo = 0.49, P = 0.49). Overall, these results 
show that brain levels of dopamine increase 
when foragers start dancing in the hive to 
convey information about profitable food sources 
but then transiently decrease toward the end 
of the dance. Departure toward the reported 
food source is again accompanied by an in- 
crease of dopamine brain levels, whereas food 
ingestion at the goal decreases levels until the 
next dancing event. These fast, transient acti- 
vations of the dopaminergic system could be 
indicative of an enhanced appetitive motivation 
upon food finding and recollection of food value 
through initiation of waggle dances. 

The increased levels of dopamine in the 
brain of foragers may have been influenced 
by their 3-day cumulative flight experience 
prior to biogenic amine quantification. We 
therefore wondered whether the first expe- 
rience with a profitable food source already 
enhanced dopamine levels. Marked bees were 
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Fig. 1. Food expectation upon departure toward a food source and dance start increase dopamine 
levels in the honey bee brain. The three panels show dopamine (DA, orange bars) and serotonin (5-HT, 


green bars) levels (nanograms per brain; mean + SEM) 


measured in individual brains of foragers belonging to 


different behavioral categories. Lowercase letters (a and b) above the bars indicate significant differences 


(Tukey tests; P < 0.05). Scatter plots show individual 
foragers that were observed dancing within the hive a 
eported in their dances and before they started feedi 
corresponds to the same kind of bees captured 1 min 
(DA:n =1 


(DA: n = 15; 5-HT: n = 15). “Dancing cessation” refers 
unavailable for 30 min (DA: n = 15; 5-HT: n = 15). (B) 


data distribution. (A) “Feeder arrival” corresponds to 
nd that were captured upon arrival at the feeder 

ng (DA: n = 11; 5-HT: n = 10). “Feeder feeding” 

after they landed on the feeder and started feeding 


5; 5-HT: n = 15). “Dancing start” corresponds to bees that started dancing right after their 
eturn to the hive from the feeder and that were captured upon dance initiation (DA: n = 15; 5-HT: n = 14). 
“Dancing end” refers to the same kind of bees captured upon dance ending, ~1 min after dance start 


to bees that stopped dancing after food was made 
“Hive departure” corresponds to foragers captured 


upon departure from the hive after dancing (DA: n = 24; 5-HT: n = 24). “Feeder arrival” corresponds to 


the same kind of bees captured upon arrival at the feeder and before they started feeding (DA: n = 23; 5-HT: 
n = 23). (C) “First arrival” shows foraging bees arriving at the feeder in their first active flight after prior 


appetitive experience at that location (DA: n = 19; 5-H 


T: n = 19). “First feeding” shows the same kind of 


bees captured 1 min after they landed on the feeder and started feeding (DA: n = 13; 5-HT: n = 18). 


captured upon their first active visit (i.e., they 
flew on their own rather than being trans- 
ported by the experimenter) to a feeder to which 
they were previously brought. Capture occurred 
either immediately upon arrival at the feeder 
or 1 min after landing and feeding. Figure 1C 
shows a significant interaction between the 
factors “biogenic amine” and “behavioral con- 
dition” (F,.¢5 = 16.36, P < 0.001). Whereas 
serotonin levels did not vary between the two 
experimental conditions (Tukey tests, not sig- 
nificant), dopamine levels were higher in bees 
arriving at the feeder for the first time (P < 
0.001). Thus, specific expectations about prof- 
itable rewards are accompanied by high dopa- 
mine levels, irrespective of the amount of flight 
experience. 

Having shown that waggle dances tran- 
siently enhance dopamine levels of dancers, 
we sought to determine whether the same 
effect was observable in dance followers that 
had never visited the feeder. We monitored 


waggle dances of marked foragers reporting 
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the location of a trained feeder and captured 
nonmarked followers (i.e., bees with no feeder 
experience), either within 5 s after they started 
following the dancer (“short-term followers”) or 
after 50 to 60 s (“long-term followers”). We also 
captured dancers in the middle of their dances 
(typically 20 to 30 s after dance start). We mea- 
sured brain levels of biogenic amines and found 
that the interaction between the factors “bio- 
genic amine” and “behavioral category” was 
not significant (fig. S1; Fs 199 = 0.17, P = 0.85). 
Significant differences were found between 
dopamine and serotonin (Fi 100 = 41.26, P < 
0.0001) and between bee categories (F199 = 
16.38, P < 0.0001). Dopamine levels were 
higher in dancers and increased in followers 
from the start to the end of the dance, yet this 
increase was not significant (Tukey tests). A 
similar trend was found for serotonin. Thus, 
dances might not activate per se a dopamine- 
based wanting system in followers. 

To confirm the presence of a dopamine- 
based wanting system in foragers, we interfered 


with dopaminergic signaling and determined 
the effect of this interference on foraging. 
We trained marked foragers to collect sucrose 
solution from a feeder and measured their 
frequency of foraging bouts and their return 
time, that is, the length of time between two 
consecutive visits to the feeder, as these vari- 
ables are sensitive indicators of appetitive 
motivation (9, 10). Measurements were per- 
formed 30 min before topical delivery on the 
thorax (11) of the dopamine receptor antag- 
onist fluphenazine (72, 13), the dopamine 
receptor agonist 6,7-ADTN hydrobromide 
(hereafter, 6,7-ADTN) (74-18), or the solvent 
N,N'-dimethylformamide (DMF). Another con- 
trol group remained untreated (“sham”). After 
20 min, the same foraging variables were mea- 
sured during a second 30-min period. The 
interaction between “treatment” and “mea- 
surement time” was not significant for the 
frequency of foraging bouts (Fig. 2A; analysis 
of variance for repeated measurements; F3 5 = 
1.57, P = 0.20). However, a significant variation 
was observed between measurement times 
(before versus after: Figo = 29.73, P < 0.0001) 
and between treatments (F3 55 = 3.40, P < 0.05) 
as the antagonist fluphenazine decreased the 
frequency of bouts (Tukey test, P < 0.01). In 
the case of the return time, the interaction 
between “treatment” and “measurement time” 
was significant (Fig. 2B; F350 = 7.49, P < 0.001), 
and fluphenazine was again responsible for 
prolonging it (P < 0.001). The lack of effect of 
6,7-ADTN on both variables measured may have 
been caused by a ceiling foraging motivation. 

As fluphenazine might have affected motor 
activity (19) rather than foraging motiva- 
tion, we repeated the previous experiment 
but decomposed the return time into (i) the 
time spent flying from the feeder to the hive, 
ii) the time spent in the hive, and (iii) the time 
spent flying from the hive to the feeder. Foraging 
times were quantified in marked foragers before 
and after topical exposure to DMF, fluphena- 
zine, or 6,7-ADTN. An untreated sham group 
was also included. Neither the time spent fly- 
ing from the feeder to the hive (Fig. 2C; inter- 
action between “time” and “group” factors: 
F373 = 1.92, P = 0.13) nor the time spent flying 
from the hive to the feeder (Fig. 2D; inter- 
action between "time" and "group" factors: 
F373 = 2.26, P = 0.09) varied before and after 
treatment or between groups. On the contrary, 
the time spent within the hive exhibited a 
significant interaction (Fig. 2E; F373 = 8.01, 
P < 0.001). Specifically, fluphenazine in- 
creased the time spent in the hive (Tukey test, 
P < 0.001). Thus, blockade of dopaminergic 
signaling had no effect on flight activity but 
prolonged the time spent in the hive at the 
expense of foraging. 

As honey bee foraging is determined by 
colony needs rather than by individual hunger 
(20), we asked whether individual appetitive 
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Fig. 2. Pharmacological blockade of dopaminergic signaling in honey bee 
foragers decreases the frequency of foraging and increases the time 
between successive visits to the feeder. Honey bees received a topical 
application on the thorax of either fluphenazine, 6,7-ADTN, or DMF or remained 
untreated (sham). The frequency of foraging bouts (A) and the time spent away 
from the feeder between two consecutive visits (return time) (B) were quantified 
during two 30-min periods, one before and one 20 min after topical exposure 
(6,7-ADTN: n = 22; fluphenazine: n = 24; DMF: n = 18; sham: n = 22). Treatment 
with fluphenazine significantly decreased the frequency of foraging bouts and 


needs also activate a dopamine wanting sys- 
tem. To this end, we studied whether starva- 
tion of individual foragers enhanced dopamine 
brain levels. Marked foraging bees of con- 
trolled age were captured at an artificial feeder 
and harnessed individually [see supplemen- 
tary materials (SM) for details]. Bees were 
housed in an incubator for 1 or 2 hours and 
then frozen in liquid nitrogen for quantifying- 
dopamine and serotonin brain levels. The inter- 
action between the factors “biogenic amine” and 
“starvation time” was significant (Fig. 3A; Fiso = 
7.25, P < 0.01), as only dopamine (Tukey tests, P < 
0.05), not serotonin, levels increased with starva- 
tion time. This increase is consistent with the 
activation of an individual wanting system. 
We reasoned that manipulating dopamine 
levels would change the hedonic value of sucrose 
solution and thus individual appetitive respon- 
siveness. To test this hypothesis, foragers cap- 
tured upon landing at a feeder were individually 
enclosed within syringes, which allowed them 
to be fed with 30 ul of 1.0 M sucrose solution 
via a tip inserted in the syringe hub (27). In 
this way, we reduced individual appetitive 
responsiveness to be able to observe behavioral 
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changes due to artificial dopamine increase. 
After feeding, bees were topically exposed to 
dopamine, 6,7-ADTN, or the solvent DMF. A 
sham group was also included. Bees were then 
individually harnessed and left at rest (see SM 
for details). They were then stimulated with six 
increasing concentrations of sucrose solution 
delivered to the antennae (0.1, 0.3, 1, 3, 10, and 
30%, w/w) (22, 23). The appetitive proboscis 
extension response to these stimuli was quan- 
tified, and an individual sucrose responsive- 
ness score was calculated on the basis of the 
number of sucrose concentrations that elicited 
proboscis extension. Appetitive responsiveness 
varied between treatments (Fig. 3B; Kruskal- 
Wallis test; H5,595 = 57.55, P < 0.0001). Whereas 
the two control groups (sham and DMF) showed 
similarly low responsiveness scores (mean rank 
comparison, not significant), the highest dose of 
both dopamine and 6,7-ADTN significantly in- 
creased sucrose scores (P < 0.0001). We con- 
firmed this finding in bees availing themselves of 
an even higher energy supplement as they were 
individually fed with a mixture of honey, pollen, 
and sucrose solution (24-26) before the sucrose- 
responsiveness assay. Enhancing dopamine 


increased the return time, thus revealing a negative impact on foraging 
motivation. (C to E) Pharmacological blockade of dopaminergic signaling in 
honey bee foragers increases the time spent in the hive at the expense of 
foraging. Neither the time spent flying from the feeder to the hive (C) nor from 
the hive to the feeder (D) was affected by the pharmacological treatments. On 
the contrary, blockade of DA signaling increased the time spent in the hive at 
the expense of the foraging time (E) (6,7-ADTN: n = 22; fluphenazine: n = 20; DMF: 
n= 22; sham: n = 13). In all panels, the values shown correspond to the mean + SEM 
**P < 0.001; ns, not significant; Tukey tests. 


levels in these bees rescued their appetitive 
responsiveness (fig. S2; H3 41; = 130.08, P < 
0.0001), as their sucrose scores were sig- 
nificantly higher than those of the fed controls 
(P < 0.05). 

We then tested whether manipulating dopa- 
mine would affect learning and retrieval of 
odorants predicting food. We took advantage 
of the olfactory conditioning of the proboscis 
extension response in which harnessed bees 
learn to associate a neutral odorant with a 
reward of sucrose solution (27, 28). Foragers 
captured upon landing at a feeder were handled 
as in the previous experiments and topically 
exposed to DMF, the higher dose of dopamine, 
or the dopamine receptor antagonist flupen- 
tixol (13, 29). An additional control was left 
untreated (sham). Each group was trained to 
discriminate 1-nonanol from 1-hexanol (30) 
(see SM for details). 

All four groups learned the olfactory discrim- 
ination (fig. S3). To compare group performance, 
we calculated for each bee an acquisition score 
over the four last trials, which could vary be- 
tween 2 and —2, with a higher positive score 
indicating successful learning. Scores varied 
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Fig. 3. An individual dopamine-dependent 
wanting system drives both appetitive 
responsiveness and appetitive learning and 
memory in honey bees. (A) Increased starvation 
enhances dopamine levels in the brain of 
foragers. Dopamine (DA) and serotonin (5-HT) 
measurements were performed in harnessed 
23-day-old bees after immobilizing them and 
subjecting them to starvation periods of 1 

or 2 hours (DAj,: 9 = 42; DAs, n = 41; 5-HT}p: 

n = 39; 5-HT>,: n = 41). Increased hunger 
translates into higher DA levels in the bee brain. 
Lowercase letters (a, b, and c) above the bars 
indicate significant differences between groups 
(Tukey tests, P < 0.05). (B) Artificial increase 
of DA levels in honey bee foragers enhances 
their appetitive responsiveness to sucrose 
solution. Honey bee foragers captured at a feeder 
upon landing and before they started feeding 
were fed 30 ul of 1.0 M sucrose solution and 
then topically exposed on the thorax to one of two 
doses of either dopamine (DA;: n = 29; DAz: 

n = 30) or 6,7-ADTN (ADTN;: n = 34; ADTN2: n = 38). 
Control bees were treated with DMF (n = 47) or remained untreated (sham; n = 47). A sucrose responsiveness 
score (SRS) was computed for each bee subjected to an increasing series of six sucrose concentrations 
on the basis of its appetitive proboscis extension response (PER) in response to these stimulations. Higher SRSs 
indicate higher appetitive motivation. The highest dose of both DA and 6,7-ADTN significantly increased SRSs, 
thus revealing a higher appetitive motivation. For each treatment, the figure shows the median, 10th, 25th, 75th, 
and 90th percentiles as box plots with error bars; outliers are shown as 5th and 95th percentiles; the blue line 
within each box plot shows the mean. Lowercase letters (a and b) above the bars indicate significant differences 
between groups (Kruskal-Wallis test, post hoc mean rank comparisons, P < 0.05). (C) Artificial increase of 

DA levels in honey bee foragers enhances their appetitive olfactory learning. Honey bee foragers captured at a 
feeder upon landing and before they started feeding were harnessed and then topically exposed on the 
thorax to either dopamine (n = 48) or flupentixol (n = 54). Control bees were treated with DMF (n = 44) or 
remained untreated (sham; n = 48). Bees were trained along six trials to discriminate two odorants, one 
rewarded (CS+) and another not rewarded (CS-), using the olfactory conditioning of the PER. An acquisition 
score (ACQS) was computed for each bee on the basis of its responses to the conditioned odorants in the last two 
CS+ and last two CS- trials. Higher positive ACQSs indicate better learning performances. Bees in which DA 
was increased artificially demonstrated improved learning performance and exhibited significantly higher ACQSs. 
For each treatment, the figure shows the median, Oth, 25th, 75th, and 90th percentiles as box plots with 
error bars; outliers are shown as 5th and 95th percentiles; the blue line within each box plot shows the mean. 
Lowercase letters (a and b) above the bars indicate significant differences between groups (Kruskal-Wallis test, 
post hoc mean rank comparisons, P < 0.05). (D) Artificial increase of DA levels in honey bee foragers 
enhances their appetitive olfactory memory retrieval. Honey bee foragers captured at a feeder upon landing 

but before they started feeding were harnessed and trained to discriminate two odorants, one rewarded (CS+) 
and another not rewarded (CS-), using the olfactory conditioning of the PER. Bees that managed a successful 
discrimination in the last conditioning trial (i.e., responding to the CS+ and not to the CS-) were then 
topically exposed on the thorax to either dopamine (n = 33) or the flupentixol (n = 33). Control bees were treated 
with DMF (n = 33) or remained untreated (sham; n = 32). Memory retrieval was tested 1 hour after the last 
conditioning trial and 30 min after the topical exposure by presenting bees with the CS+ and the CS— in the 
absence of reward. The percentage of bees exhibiting specific memory (i.e., responding to the CS+ and not to 
the CS-) differed between treatments. It was significantly higher in the group treated with DA. Lowercase letters 
(a and b) above the bars indicate significant differences between groups (x contingency analysis, P < 0.005). 
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significantly between treatments (Fig. 3C; H3.194. = 
17.47, P < 0.0001). Bees with elevated dopa- 
mine levels learned better than the two control 
groups (Fig. 3C; P < 0.05) and the group treated 
with flupentixol (Fig. 3C; P< 0.01). Treatment 
with flupentixol tended to reduce discrimina- 
tion at the end of training (fig. S3D), but the 
corresponding score did not differ from that of 
controls. 
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Finally, we determined whether manipulat- 
ing dopamine brain levels would affect retrieval 
of an appetitive memory evoked 1 hour after the 
last conditioning trial. After conditioning for- 
agers to discriminate the same previous two 
odorants, individuals that responded to the 
positive but not to the negative odorant in 
the last conditioning trial (fig. S4) were cold- 
anesthetized and assigned to different topical 


treatments (dopamine, flupentixol, or DMF). 
An additional group remained untreated (sham) 
(see SM for more details). Bees were then tested 
with both odorants, and the percentage of bees 
exhibiting specific memory [i.e., responding 
to the positive but not to the negative odorant 
(30)] was quantified for each group. This per- 
centage varied significantly with the treat- 
ment (Fig. 3D; ” = 12.47, df: 3, P < 0.005), as 
the group topically exposed to dopamine had 
a better retrieval performance than the other 
groups, which exhibited no differences in re- 
trieval (y? = 4.05, df: 2, not significant). Im- 
portantly, altering serotonin levels affected 
neither sucrose responsiveness (fig. S5A) nor 
olfactory appetitive learning (fig. S5B). Thus, 
the changes induced in appetitive responsive- 
ness, learning, and memory are specific to 
dopaminergic signaling and point to a driving 
role for dopamine in an individual wanting 
system. 

Our results suggest that distinct food-related 
motivational and communication states deter- 
mine different levels of dopamine in the brain 
of honey bee foragers, consistent with the 
existence of a dopamine-based wanting sys- 
tem activated by both colony and individual 
appetitive needs. Foragers driven by an en- 
hanced motivation for food exhibit higher 
dopamine levels. Accordingly, blockade of 
dopaminergic signaling results in a decrease 
of foraging activities, consistent with the in- 
hibition of a wanting system mediated by 
dopaminergic signaling (J, 4, 5). Dopamine 
levels also increase when foragers report dis- 
tance and direction of a profitable food source 
by means of the waggle dance and upon de- 
parture toward the appetitive goal. This finding 
suggests that dancers do not only report the 
vectorial information leading to the food source 
but also recollect transiently the appetitive 
properties of the food source at the start of 
dancing and when leaving the hive, thereby 
elevating dopamine levels in their brain. The 
waggle dance is thus more than an innate, 
automatic behavior (7), as it sets the frame- 
work for a dancer’s transient evocation of 
food-source properties. 

In addition to this socially embedded want- 
ing system, individual hunger also leads to 
an increase in the level of dopamine in the 
forager brain. Accordingly, artificial elevation 
of dopamine increases the incentive value of 
sucrose solution and individual appetitive re- 
sponsiveness and improves olfactory learning 
and memory retrieval. These results are consist- 
ent with alliesthesia modulation of incentive 
value by relevant physiological state (37) as, for 
instance, the improvement of memory retrieval 
upon dopamine elevation and in the absence 
of reward indicates that the incentive salience 
of a learned odor can be enhanced through an 
activation of the wanting system. Honey bees, 
unlike some mammals, are strictly eusocial 
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individuals with a sophisticated division of 
labor (20). As a result, their wanting sys- 
tem integrates both the social and the indi- 
vidual levels. 

A neuroscience of pleasure has revealed that 
several common principles of the neural encod- 
ing of stimuli with positive hedonic value are 
shared among mammals (7). The presence of 
a dopamine wanting system in the honey bee 
brain suggests that a precursor to mammalian 
wanting systems may have evolved very early, 
that is, 220 million years ago, dating back 
to the oldest currently known Hymenoptera 
fossils (32). 
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EPIDEMIOLOGY 


Rabies shows how scale of transmission can enable 
acute infections to persist at low prevalence 


Rebecca Mancy’, Malavika Rajeev, Ahmed Lugelo*“, Kirstyn Brunker’, Sarah Cleaveland?, 
Elaine A. Ferguson’, Karen Hotopp’, Rudovick Kazwala*, Matthias Magoto®, Kristyna Rysava®, 


Daniel T. Haydon’, Katie Hampson** 


How acute pathogens persist and what curtails their epidemic growth in the absence of acquired immunity 
remains unknown. Canine rabies is a fatal zoonosis that circulates endemically at low prevalence among 
domestic dogs in low- and middle-income countries. We traced rabies transmission in a population of 
50,000 dogs in Tanzania from 2002 to 2016 and applied individual-based models to these spatially 
resolved data to investigate the mechanisms modulating transmission and the scale over which they 
operate. Although rabies prevalence never exceeded 0.15%, the best-fitting models demonstrated 
appreciable depletion of susceptible animals that occurred at local scales because of clusters of deaths and 
dogs already incubating infection. Individual variation in rabid dog behavior facilitated virus dispersal and 
cocirculation of virus lineages, enabling metapopulation persistence. These mechanisms have important 
implications for prediction and control of pathogens that circulate in spatially structured populations. 


nderstanding the processes that regu- 

late endemic disease dynamics remains 

a long-standing challenge in epidemio- 

logy (1, 2), and the mechanisms that 

enable long-term persistence at low pre- 
valence remain largely unexplored (3). This 
is particularly true for canine rabies, a fatal 
zoonotic virus for which naturally acquired 
immunity has not been demonstrated. The 
basic reproductive number Ro of rabies, which 
is defined as the expected number of sec- 
ondary cases produced by a typical infectious 
individual in a fully susceptible population 
(4), is low (between 1.1 and 2) and is relatively 
insensitive to dog density (5), making the dis- 
ease amenable to elimination through dog vac- 
cination (6). Yet dog-mediated rabies remains 
endemic across Africa and Asia, where it kills 
tens of thousands of people every year (7), and 
its persistent circulation at such low preva- 
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lence in largely unvaccinated populations is 
an enduring enigma. 

Rabies is, however, a usually tractable sys- 
tem for understanding how population-level 
patterns of infection emerge from pathogen 
transmission at the individual level. Rabies is 
transmitted through bites, which can often be 
observed, and the clinical signs are readily 
identifiable, with infected animals typically 
dying within 1 week of disease onset (Fig. 1E). 
Capitalizing on these distinctive characteris- 
tics, we conducted exhaustive contact tracing 
to generate spatially resolved data on rabies 
infection and transmission in Serengeti district, 
Northern Tanzania, between January 2002 
and December 2015. Serengeti district adjoins 
other populated districts to the north and west 
and Serengeti National Park to the southeast 
(Fig. 1A). We previously showed that domestic 
dogs maintain rabies in this part of Tanzania, 
with infrequent spillover to wildlife and spill- 
back to domestic dogs (8). In Serengeti dis- 
trict’s population of around 50,000 dogs (and 
250,000 people), we traced 3612 rabies infec- 
tions (comprising 3081 cases in dogs, 75 in 
cats, 145 in wildlife, and 311 in livestock) (Fig. 1), 
along with 6684 potential transmission events 
to other animals and 1462 people bitten by 
rabid animals, of whom 44 died from rabies. 
Most identified cases could be statistically 
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Fig. 1. Rabies in Serengeti district. (A) Mapped cases are in red. Shading 
indicates dog density, and lines indicate village boundaries. (Inset) The district 
location in Tanzania. (B) Monthly time series of cases in domestic dogs (red; n = 
3081) and other carnivores (gray, n = 214). Species are detailed in (9). (C) Dogs 
bitten per rabid dog versus dog density at each rabid dog's location (1-km? scale), 
showing no apparent relationship. The red line indicates the generalized additive 
model prediction, and gray lines indicate the standard error; P > 0.05. (Inset) The 
proportional distribution of the dog population and of case locations in relation 

to dog density on a log scale. Squares indicate mean dog population density (black; 


linked to plausible progenitors, indicating that 
contact tracing detected most cases. Further 
analysis indicated that 83 to 95% of cases 
were detected, with missed cases mainly being 
those generating limited, if any, onward trans- 
mission (9). These data show that despite local 
vaccination effort (coverage varying between 
10 and 40%) (fig. S3B), rabies circulated contin- 
uously, with a maximum prevalence of just 0.15%. 

Endemic diseases are thought to be primar- 
ily regulated by the depletion of susceptible 
hosts, typically through disease-induced (or 
vaccine-acquired) immunity, counterbalanced 
by births of susceptibles and deaths (4). Yet 
the very low prevalence and absence of ac- 
quired immunity for rabies indicates that 
large-scale depletion of susceptible hosts is 
negligible, challenging this explanation as a 
mechanism for persistence. We estimated dog 
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densities at high spatial resolution through a 
district-wide census, georeferencing almost 
36,000 households and recording the vacci- 
nation status of dogs [in this setting, almost 
all dogs are owned but also free-roaming, 
and there are no feral dogs (J0)]. Although 
we saw no clear relationship between dog 
population density and contact rate when 
examining dogs bitten per rabid dog (Fig. 
1C), mapping rabies infections revealed a 
small yet significantly higher incidence of 
cases in higher-density areas (Fig. 1C), which 
suggests density-dependent processes. These 
observations are difficult to reconcile: How 
does transmission respond to dog population 
density, and what processes keep prevalence 
so low? 

We propose that understanding the fine- 
scale structure of rabies transmission networks 
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23 dogs/km*) and mean dog density at case locations (red; 41 dogs/km*), indicating 
higher per capita incidence in higher-density areas (independent samples t test 

on log-transformed data, T = 22.45, P < 2.2 x 10°). (D and E) Distributions of (D) 
rabid dog step-lengths between contacts and (inset) distances to contacts and 

(E) serial intervals and (inset) infectious periods. The best-fitting distributions are 
in red (table S1), with step-lengths and distances censored for animals with unknown 
starting locations (9). (F) Dogs bitten per rabid dog from contact tracing (gray) 
and simulated from the individual-based model (red; 5th to 95th percentiles). The 
y axis is square root transformed in (D) and (F) to better illustrate extreme values. 


is critical to explaining its persistent dynamics 
and can inform its control and eventual elimi- 
nation. We used the serial interval distribution— 
defined as the interval between the onset of 
infection in primary and secondary cases— 
and movement of traced rabid dogs to recon- 
struct transmission trees, comprising putative 
introductions into the district and descendent 
chains of transmission. Over the 14 years, we 
estimated around 238 introductions (8 to 
24 per year) that led to onward circulation, 
most likely spreading from neighboring vil- 
lages (movie S1). Twenty-two transmission 
chains, accounting for >70% of cases, circu- 
lated for more than 12 months (including two 
for more than 4 years), illustrating how co- 
circulation of lineages contributes to persist- 
ence within a metapopulation (Fig. 2). We 


applied an individual-based model, seeded 
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Fig. 2. The spatiotemporal distribution of transmission chains. (A) Monthly cases, estimated to be between 
83 and 95% of all cases in the district. The 11 chains with most cases are highlighted in color, and all smaller 
chains (<58 cases) are shaded gray. (B) Spatial distribution of monthly cases, from the consensus tree of 
1000 bootstrapped reconstructions (9). 
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Fig. 3. Time series of rabies cases and simulated counterfactual scenarios. Observed cases (red), with 
interquartile (dark shading) and 95% (light shading) prediction intervals, both computed pointwise, from simulations 
at the 1-km* scale, and with three illustrative example runs (dark lines). The simulated scenarios are (A) with 
vaccination campaigns as implemented, (B) under low vaccination coverage, (C) with vaccination campaigns as 
implemented but no individual heterogeneity in contact parameters, and (D) without incursions after the first year. 
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by these introductions, to the spatially resolved 
case and dog density data to investigate the 
processes that modulate transmission, the 
scale over which they operate, and how they 
facilitate these metapopulation dynamics. 

We examined the effect of host (dog) den- 
sity on contact (biting) within our stochastic 
individual-based model, drawing from predator- 
prey functional response theory (JJ), to fit pa- 
rameters defining the transmission process in 
relation to host density (12). We fitted the pa- 
rameters assuming that susceptible and infec- 
ted dogs were well mixed at different scales, 
ranging from the whole district to within 
0.25-km? grid cells (fig. $2). The best-fitting 
parameters at each scale differed in their sim- 
ulation outcomes. Only models at the 1-km? 
scale reliably reproduced observed dynam- 
ics and captured emergent population- and 
individual-level properties (Figs. 1F and 3A 
and fig. S10). We conclude that the processes 
that regulate the size of outbreaks and overall 
prevalence of rabies operate primarily on scales 
that are much smaller than those typically 
modeled for this disease. 

Our modeling shows that epidemic growth is 
curtailed in two ways: from deaths of rabid dogs 
reducing contact opportunities and through 
redundant exposures of dogs already incubat- 
ing infection. These processes increasingly stem 
transmission in lower-density areas, where 
dogs have fewer contact opportunities (Fig. 4). 
Simulations of index infections to estimate Ro 
Gn an entirely susceptible population) show 
that rabid dogs bite, on average, 2.91 dogs, 
leading to approximately 1.47 secondary cases 
per index case [95% percentile intervals (PI) 
1.39 to 1.56]. This Ro value is slightly higher 
than previously estimated (5), in part because 
of population growth (median dog density 
increased from 12 to >20 dogs/km? over the 
14 years) but varies across the landscape in 
relation to dog densities (Fig. 4) and accord- 
ing to how it is measured; estimates simu- 
lated from dogs sampled from the landscape 
(by grid cell, rather than in proportion to 
density) were slightly lower (1.35, 95% PI 1.27 
to 1.43), whereas estimates from dogs sampled 
from the transmission network were slight- 
ly higher (1.48, 95% PI 1.38 to 1.58) because 
more cases occur in higher-density grid cells 
(Fig. 4) (9). 

Under endemic circulation, the effective re- 
production number, R, declines by just over 
30% and remains near 1. Around 78% of this 
reduction is from recent rabies deaths remov- 
ing potential contacts, and 22% is from re- 
exposures of already incubating dogs. Thus, 
infected dog movement (fig. S1) determines 
the scale of mechanisms that regulate endem- 
ic dynamics, so that even small outbreaks 
(approximately five cases per square kilome- 
ter) can result in substantive reductions in 
R, given the heterogeneous distribution of 
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Fig. 4. Rabies transmission in relation to population density. (A) Distribution of 
dog densities in Serengeti district on a log scale. (B) Holling curves computed from 
the most reliable parameter set at each scale (9), with contact rate rescaled by 

the median infectious period (2 days) and dog density on a log scale. Gray shading 
indicates dog densities below the median (16.4 dogs per km?) midway through 

the period (median density increased from 12 to 22 dogs/km? over the 14 years). 
(Inset) Replotted on a linear scale. At the optimal spatial scale (red line; 1 km?), contact 
rates are density dependent at low dog densities and become increasingly density 


dogs on the landscape (Fig. 4). There remains 
ambiguity in the mechanisms that underpin 
density-independent transmission at higher 
dog densities. Human responses likely play a 
role (45% of traced rabid dogs were either 
killed or tied) and are to some extent cap- 
tured in our model, but these may operate 
differently during larger outbreaks (beyond 
those observed) and in more urbanized pop- 
ulations (<2% of dogs in this rural district 
live at densities >100 dogs/km?”). Our data 
highlight how better understanding of func- 
tional responses that describe theoretical 
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relationships of transmission with density 
(10, 11) are needed to predict endemic path- 
ogen dynamics. 

Individual variation in disease transmission 
causes rare but more explosive outbreaks and 
more frequent extinctions (13). We observed 
considerable variation in rabid dog behavior 
(Fig. 1D, with a few rabid dogs biting many 
others (4% of rabid dogs bit >10 other dogs 
each, and four bit >50 dogs) and running long 
distances (nine rabid dogs ran >10 km to con- 
tact other animals). Overdispersion in the size 
of transmission chains (fig. S11) reflects this 
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independent at higher densities. Parameter sets at other scales failed to reliably 
generate the observed dynamics. The Holling curve indicates how even removal of 

a small number of susceptible dogs locally (from rabies deaths or incubating infection) 
reduces transmission so that R approaches 1, which corresponds to around two 
contacts per infection (horizontal line), with half developing rabies. (€) Histogram of 
Ro estimates from simulating index infections (9), either by location (blue), density 
(orange), or from the transmission network (red). (D and E) Mapped (D) Ro estimates 
and (E) dog density at the midpoint. 


variability in rabid dog behavior and thus 
makes persistence more remarkable (73). Our 
modeling captured this individual heteroge- 
neity (Fig. 1F), revealing its relevance for rabies 
dynamics: In counterfactual simulations with- 
out individual heterogeneity, rabies incidence 
was reduced by around 50% (fig. S7C) and the 
(relatively) large outbreaks observed in nature 
and in simulations with heterogeneity did 
not occur (Fig. 3, C versus A). The biological 
drivers of this variation result predominantly 
from individual-level (13) rather than environ- 
mental or population-level differences (74, 15), 
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such as host density, but are poorly under- 
stood. Behavioral manifestations of infection 
that depend on sociological and pathological 
factors, such as exposure dose and sites of 
viral proliferation (16), might underpin this 
individual-level variation. 

Both the scale of and heterogeneity in con- 
tact and movement are crucial to capturing 
rabies dynamics (Fig. 3). Density-dependent 
transmission processes, although well described 
theoretically (2, 17), are extremely challenging 
to quantify in relation to spatial scale. This 
may explain why few empirical studies of di- 
rectly transmitted diseases have found evidence 
of strong density-dependent transmission 
(18-20). For rabies, the main susceptible de- 
pletion mechanisms—deaths of rabid dogs 
and reexposures of dogs already incubating 
infection—only curtail transmission if infec- 
tion is clustered and explain why nonspatial 
models of rabies do so poorly at replicating 
dynamics, because rabies incidence is negli- 
gible at the population-level. Clustering has 
been shown to reduce pathogen transmission 
through build-up of immune individuals (73), 
as well as in the context of redundant biting by 
insect vectors of disease (27); it has also been 
theorized to reduce transmission in the early 
stages of epidemics (22). For rabies, the in- 
cubation period acts in a way similar to that 
of immunity, resulting in redundant expo- 
sures that limit transmission. Natural immu- 
nity is not generally considered relevant in 
canine rabies or required to explain persist- 
ence, but antibodies have been detected in 
healthy unvaccinated dogs (17). If short-lived 
immunity does follow aborted infections, as 
may be the case for vampire bat rabies (78), 
our expectation is that it would cluster in the 
same way as incubating infections do, rein- 
forcing local-scale effects. Clustering has been 
demonstrated to make outbreaks less explo- 
sive and to extend persistence (19), yet the 
potential relevance of such microdepletion 
mechanisms to many pathogens may be under- 
estimated because their measurement relies 
on sufficiently resolved datasets. Our conclusion, 
that the relevant spatial scale at which to con- 
sider host density is determined by the scale of 
movements of infectious hosts, offers a start- 
ing point for the appropriate spatial scale at 
which to model other pathogens for which 
spatially detailed data are lacking. 

Our analyses further illustrate the degree to 
which introduced cases contribute to rabies 
persistence. In the absence of introductions 
and under observed levels of vaccination, we 
expect infection to circulate in the Serengeti 
district for up to 7 years, typically dying out 
within 4 years (Fig. 3D). But, with between 8 to 
24 rabid dogs arriving each year from neighbor- 
ing villages, infection persists even under rea- 
sonable vaccination coverage, even though 
most cause only short-lived chains of trans- 
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mission (fig. S10). In settings where vaccina- 
tion coverage is negligible (dog populations 
across much of Africa and Asia), our simula- 
tions indicated a mean duration of outbreaks 
from single introductions of between 10 and 
30 weeks; however, the maximum exceeded 
12 years (fig. S10). Locally self-limiting clusters 
of cases recur on the landscape (movie S1) and, 
in combination with heterogeneous movement 
and contact, permit the invasion and cocircu- 
lation of multiple lineages (Fig. 2) (20). Recur- 
rent introductions and extinctions have been 
reported in many endemic settings (27-23), 
and cross-border introductions have led to 
rabies emergence in several previously rabies- 
free areas (24-28). In contrast to diseases such 
as mosquito-transmitted Dengue (29), chains 
of infection circulate largely independently, 
given the low prevalence of cases and very lo- 
calized susceptible depletion. The concurrent 
extinction of all lineages therefore becomes less 
probable as more chains of infection cocirculate. 

From a practical perspective, our findings 
explain why dog culling has typically been so 
ineffective for controlling rabies; dog popula- 
tions would need to be reduced below very low 
densities across all areas where infection is 
circulating. Culling more than 50% of the 
400,000 dogs in Flores, Indonesia, had no ap- 
parent impact on rabies circulation (30). Our 
results reinforce the message that mass dog 
vaccination remains the most effective and 
feasible method of controlling rabies and pro- 
vides insights that should inform elimination 
strategies. Simulations indicate that although 
dog vaccinations prevented more than 4000 
animal cases, 2000 human rabies exposures, 
and 50 deaths in the Serengeti district, intro- 
ductions continually seeded new foci, with 
scaled-up dog vaccination (beyond the district) 
required to achieve elimination (Fig. 3). Infre- 
quent longer-distance movements of rabid dogs 
seed outbreaks in unaffected localities across 
heterogeneously distributed populations, lead- 
ing to localized flare-ups where vaccination 
coverage is not maintained. The resulting low 
prevalence persistence presents a challenge 
for elimination, given that surveillance is very 
weak in most rabies-endemic regions. Yet the 
concurrent circulation of viral lineages offers 
an opportunity for using increasingly afford- 
able genomic approaches to assess the per- 
formance of both rabies surveillance and 
control (31). Differentiating undetected circu- 
lation from reintroductions will be necessary 
as control efforts are scaled up toward the 
2030 goal of zero human deaths from dog- 
mediated rabies (32). 
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Dynamics of particle network in composite 
battery cathodes 


Jizhou Li}, Nikhil Sharma*}, Zhisen Jiang’, Yang Yang*+, Federico Monaco®, Zhengrui Xu*, 
Dong Hou’, Daniel Ratner®, Piero Pianetta’, Peter Cloetens*, Feng Lin**, Kejie Zhao“, Yijin Liu’* 


Improving composite battery electrodes requires a delicate control of active materials and electrode 
formulation. The electrochemically active particles fulfill their role as energy exchange reservoirs through 
interacting with the surrounding conductive network. We formulate a network evolution model to 
interpret the regulation and equilibration between electrochemical activity and mechanical damage 

of these particles. Through statistical analysis of thousands of particles using x-ray phase contrast 
holotomography in a LiNig.gMngCo91;02-based cathode, we found that the local network heterogeneity 
results in asynchronous activities in the early cycles, and subsequently the particle assemblies move 


toward a synchronous behavior. Our study pinpoints the chemomechanical behavior of individual 
particles and enables better designs of the conductive network to optimize the utility of all the 


particles during operation. 


ithium-ion batteries (LIBs), with a high 

energy density and long lifetime, have 

been widely adopted for a broad range of 

applications. The composite cathode of 

LIBs is made of many electrochemically 
active particles embedded in a conductive car- 
bon and binder matrix. The microstructure 
plays a crucial role in governing the LIB per- 
formance through modulating the electronic 
and ionic transport properties (J, 2) and the 
chemomechanical behavior (3-5). The crack- 
ing, disintegration, and (de)activation behav- 
iors of the electrochemically active cathode 
particles affect the capacity fade over prolonged 
battery cycling (6-8). 

Alleviation of the active particle damage has 
focused on understanding and tuning the mor- 
phological and chemical characteristics at the 
microscale (9-12), such as particle size, elon- 
gation and sphericity (73, 14), crystallographic 
arrangement (15, 16), mesoscale kinetics (17, 18), 
grain boundary properties (19, 20), and compo- 
sitional variation (2/—24). For instance, reduc- 
ing the primary particle size is an effective 
approach to improving the fast-charging per- 
formance because smaller particles have shorter 
ion diffusion paths (25, 26). Designing particles 
with elongated morphology, e.g., in the form of 
nanoplates or nanorods, can also improve the 
specific capacity and reduce charge transfer 
resistance (27). However, the correlation be- 
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tween the particle morphology and the cell 
performance is rather complex, with effects at 
multiple length and time scales. The dynamics 
of particle network have substantial impacts 
but are rarely studied. For example, recent 
studies have uncovered the local heterogeneity 
in the electrode, where active particles con- 
tribute to the cell-level chemistry differently in 
time and position (28). Some particles release 
lithium ions at a faster rate than their peers 
under fast-charging conditions (29). Some local 
regions could become inactive while the cell 
can still function well as a whole. To make a 
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substantial improvement effectively, the par- 
ticle structure and the electrode morphology 
should be tailored coherently, and a synergy 
could be achieved by doing so. A global homog- 
enization will eventually develop after long-term 
cycling; however, a poorly designed electrode 
would reach this state when most of its 
particles are severely damaged. By contrast, a 
well-formed electrode would rapidly converge 
to the electrode synchronization with most 
of its particles still intact. In our study, we 
image thick Ni-rich composite cathode elec- 
trodes with a multilayer of LiNip gMng1Co09102 
(NMC) particles at different states using nano- 
resolution hard x-ray phase contrast holoto- 
mography (Fig. 1A). These electrodes are 
recovered from standard coin cells that were 
cycled under fast-charging conditions for 
10 cycles and 50 cycles (fig. S1), respectively. With 
high spatial resolution and contrast, and a large 
field of view, our three-dimensional imaging 
data cover a large number of active particles 
that demonstrate a wide variety of damage 
patterns. To facilitate a statistical analysis, we 
build on our previous neural network-based 
particle identification method (7) and improve 
its accuracy and efficiency by developing a 
diagonal data fusion approach, which is illus- 
trated in figs. S2 and S3. Upon completion of 
the particle identification using this method, 
the damage level of individual particles is fur- 
ther quantified. The relative probability dis- 
tribution of the particle damage degree is 
presented in Fig. 1B, and a few randomly 
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Fig. 1. Imaging cathode electrodes with a multilayer of NMC particles using nano-holotomography. 
(A) Visualization of the composite battery cathode obtained by synchrotron nano-holotomography. Each NMC 
particle has its own properties in position, particle structure, mesoscale chemistry, and local morphology. 
(B) Probability distribution of the particle damage level. (C to F) Randomly selected examples of NMC 


particles with different levels of damage. 
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Fig. 2. Heterogeneous particle damage in battery electrodes. The spatial distributions of the severely 
damaged particles in the (A) 10-cycled and (B) 50-cycled electrodes. The degree of particle damage 
is color coded. Selected representative regions are enlarged for better visualization. The distances 
between the central damaged particle and its three nearest-neighboring damaged particles are 
annotated in the enlarged view. (C) Probability distributions of the distance between two neighboring 
severely damaged particles in 10-cycled and 50-cycled electrodes. 


selected particles with different damage pat- 
terns are highlighted in Fig. 1, C to F. 

The severely damaged particles are those 
overused during the electrochemical fast- 
charging process. Their spatial distribution 
and arrangement are evidence of the spatially 
heterogeneous electrode utility. As shown in 
Fig. 2, A and B, the severely damaged particles 
are sparsely distributed in the 10-cycled elec- 
trode and, their concentration increased upon 
further cycling, featuring a denser agglomera- 
tion in the 50-cycled electrode (as illustrated 
in the enlarged views). Figure 2C shows the 
probability distributions of the distance be- 
tween two neighboring severely damaged par- 
ticles in 10-cycled and 50-cycled electrodes, 
respectively. A shift toward shorter distance 
can be observed in the 50-cycled electrode, 
indicating a synchronization effect within 
the local particle clusters. 

We perform theoretical modeling to under- 
stand the damage and Li reaction behaviors of 
NMC particles across cycles. We envision that 
the interplay between the electrochemical ac- 
tivity and the mechanical damage regulates 
the performance of the NMC particles. The 
deeper state of charge incurs more severe con- 
sequences, such as the particle damage and 
decohesion of particles from the conductive 
agent. By contrast, the mechanical damage 
increases the cost of Li reactions and sup- 
presses the electrochemical activity of individ- 
ual particles. A more synchronous behavior of 
the composite cathode is achieved in the 
prolonged cycles because of the equilibrium 
between the electrochemical activity and me- 
chanical damage. To test this hypothesis, we 
conducted finite element analysis to model 
the electrochemical response and mechanical 
damage of a NMC cathode composed of three 
spherical NMC active particles surrounded by 
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two homogeneous porous carbon/binder (CB) 
domains of different electrical conductivities 
(Fig. 3). The intention of the computational 
model is not to capture all the explicit micro- 
structural details in the composite cathode. 
Rather, our goal is to replicate the salient fea- 
ture in the composite such that the active NMC 
particles are covered by different degrees of 
the electrically conductive agent, which results 
in various local conducting networks for 
individual particles. In this simplified model, 
the surrounding high-conductivity and low- 
conductivity CBs differ in their electrical con- 
ductivities. The model assumes that liquid 
electrolyte is soaked in the porous CB domains 
and thus the NMC particles are fully accessible 
to the Li* in the liquid electrolyte. We set dif- 
ferent ratios of coverage of CB on the periphery 
of each active particle, as illustrated in Fig. 3A. 
The interface of active particles attached to the 
high-conductivity CB undergoes a faster elec- 
trochemical reaction than the boundary en- 
closed by the low-conductivity CB. Thus, each 
active particle experiences dissimilar electro- 
chemical activities, as inferred from the diverg- 
ing concentration profiles (C/Cyax) during the 
first charging process (Fig. 3B). 

The modulation between the electrochem- 
ical activity and mechanical damage reduces 
the variation of Li concentration with the 
progression of (dis)charge cycles. As shown in 
the normalized Li concentration plot (Fig. 3B) 
and the plot of the Li concentration variation 
across the three NMC particles (Fig. 3C), the 
concentration profiles converge with battery 
operation. During the charging process, Li ex- 
traction generates a reduction of the lattice 
volume in the NMC particles (30). Consider- 
able variation in the mechanical properties of 
the NMC particles (elastic modulus ~ 140 GPa) 
and CB (elastic modulus ~ 2 GPa) generates 


strain mismatch at the interface. The apex of 
the mismatch occurs near the end of the 
charging process, as demonstrated by the 
divergence of the damage profiles in Fig. 3D 
and their corresponding differences to the 
mean damage in Fig. 3E. 

After the onset of heterogeneous damage 
among the NMC particles, the individual dam- 
age curves diverge (Fig. 3D). The individual 
NMC particle characteristic—i.e., the dissimilar 
coverage by high- and low-conductivity CBs— 
commands the degree of heterogeneous damage 
in the early cycles. With successive discharging 
and charging processes, the modulation be- 
tween electrochemical activity and mechanical 
damage reduces the imbalance within the 
system (through the interfacial resistance for 
charge transfer). Consequently, the damage 
level for all three particles converges, demon- 
strating the system’s progression toward a 
synchronized behavior. In addition, we ob- 
serve a similar transformation to synchron- 
ized damage behavior for the system with 
more NMC active particles (fig. $4). The var- 
jation in the periphery contact with high- and 
low-conductivity CB regions generates hetero- 
geneous reactions for each particle. After the 
initial divergence, the individual particle dam- 
age is tuned by the feedback to electrochemical 
activity that progresses toward a synchronized 
behavior in the composite electrode. 

Both the particle damage and Li concen- 
tration profiles theoretically confirm the 
asynchronous to synchronous evolution in 
composite cathodes. Such a transition can 
occur for a number of reasons, such as from 
the particles’ self-attributes, interactions with 
neighboring particles, and CB domains. To 
probe the evolution mechanisms from the 
intrinsic or internal physical nature of cathode 
particles, we analyzed the three-dimensional 
tomographic imaging data through an inter- 
pretable machine learning framework. 

Using more than 2000 accurately iden- 
tified NMC particles, we extract their structural, 
chemical, and morphological characteristics. 
More specifically, we divided the particle 
attributes into four different groups: position, 
chemical properties, particle structure, and 
local morphology (as illustrated in table S1 and 
fig. S5; in total, 24 attributes are extracted). As 
indicated in fig. S6, these extracted particle 
attributes demonstrate varying characteristics 
in their respective probability distributions. 
Revealing their interrelationship is not straight- 
forward and could benefit from more advanced 
computing and modeling approaches. 

We leverage the advances in machine learning 
to model relationship and dependencies among 
attributes, ie., descriptors of the cathode parti- 
cles’ properties (fig. S7). The model has to be 
both accurate and interpretable. To elucidate 
the intertwined limiting factors for battery 
cathode robustness, we explore the degree of 
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Fig. 3. Finite element analysis of the electrochemical activity and mechanical 
damage in the NMC cathode. (A) Illustration of the composite model during 
the charging process in the battery. (B) Normalized Li concentration profiles 
depict the inherent heterogeneity of the system during the first charging 
process with respect to the normalized time t/t, where t is the real time in Li 
reactions and t = 720 s is the theoretical time to reach the full capacity of NMC. 
Although the particles start with the same state of charge, Li concentration 


cathode NMC particles’ engagement in the 
cell-level chemistry through attribute correla- 
tion and damage regression. These two steps 
are accomplished using a regularized autoen- 
coder neural network (37) and random forest 
(RF) regression (32). The SHAP (Shapley ad- 
ditive explanations) (33) is utilized to rank the 
importance of the particle properties to the 
degree of particle damage during the process 
of regression, which effectively reveals the con- 
tributions of different microstructural charac- 
teristics to the damage profile for every particle 
in our electrode. The circular plot (34) is used 
for better visualization of the Pearson’s correc- 
tion among different particle attributes. Inte- 
grated with the SHAP values, the RF provides 
not only accurate regression results, but also 
the interpretability of the impacts of all the 
input attributes on individual predictions as 
well as global insights. 

Specifically, the autoencoder neural network 
compresses the input attributes into latent di- 
mension (LDs) through an encoder network 
(fig. S8), which has been extensively applied 
for scientific discovery because of its ability to 
learn nonlinear functions and its good inter- 
pretability (35, 36). The LDs of both datasets 
for 10-cycled and 50-cycled electrodes, respec- 
tively, are calculated and are subsequently 
correlated with each other through Pearson’s 
correlation. As shown in fig. S9, different LDs 
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show intertwined relationships in both cases. 
Each node in the circular plot represents one 
LD, and a connection between two nodes indi- 
cates a relatively high correlation between 
them. The sign of the correlation coefficient 
(+/-) defines the direction of the relationship. 
For the 10-cycled electrode, the first five LDs 
appear to be independent. As more LDs are 
added, we start to observe correlations among 
them. For the 50-cycled electrode, in addition 
to the common connections, several addi- 
tional correlations emerge (as highlighted by 
the dark colors in fig. S9D). The observation of 
a higher degree of interdependence among 
different LDs indicates that the particles’ struc- 
tural and chemical characteristics become more 
intertwined upon battery cycling. 

When interpreting the model regression re- 
sults with the SHAP values, which utilize the 
game-theory-based Shapley values (37), the 
contribution of each attribute to the model’s 
output (the particle damage degree) can be ob- 
tained. Attributes with larger SHAP values 
are considered to be more important to the 
target damage degree. The contribution scores 
of all attributes to the particle damage in both 
10-cycled and 50-cycled electrodes are pre- 
sented in Fig. 4 (attributes are grouped on the 
basis of their properties and reordered for 
better visualization), and the interpretation is 
provided below. The advantage of using SHAP 
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differs at the end of the first charge process. (C) The variation of Li concentration 
profiles among three NMC particles. The overall trend demonstrates the 
tendency toward a synchronized behavior. (D) The damage profiles for 

three NMC active particles diverge near the end of the first charge process. 
With the progression of the cycling process, the damage profiles for all 

three particles converge. (E) Each particle's deviation from the mean damage 
profile (the black dashed line). 


to explain the regression model is its super- 
ior robustness to correlated attributes com- 
pared to the traditional methods (38), e.g., 
the Pearson’s correlation, which cannot sys- 
tematically capture the key differences in the 
studied electrodes (fig. S11). 

From our model-based prediction in Fig. 4, 
some of the attributes follow expected trends. 
For example, the particle’s depth, Z, affects the 
particle damage (28, 39). This can be related 
to the cell polarization effect, which results 
in particles at different depths experiencing 
different states of charge at a given time. 
The Z-dependence of particle damage is more 
pronounced in the 10-cycled electrode, in 
good agreement with previous reports (9, 28). 
The electron density, EDensity, has been 
associated with the state of charge (7) and 
its averaged value and degree of variation, 
Homogeneity, shows considerable impact 
on the particle damage throughout the cycl- 
ing process. The surface area and rough- 
ness (RoughOuter, RoughInner, SurfOuter, and 
SurfInner; table S1) could affect the cohesion 
of the active particles and the CB matrix. 
Therefore, the surface characteristics could 
affect the particle damage. The particle’s size, 
Volume, appears to be correlated with the par- 
ticle damage. Its contribution score seems 
slightly lower in the 50-cycled electrode. This 
trend suggests that the particle-size effect might 
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Fig. 4. Interpretable machine learning framework for particle attributes modeling. The contribution 
scores of all attributes to the particle damage in 10-cycled (green) and 50-cycled (blue) electrodes. Triangle 
and square markers represent results from two robustness validation approaches, data-subsampling and 
random-seeding, respectively. The mean and SD of the differences in the contribution scores (N = 20) 


between the 10-cycled and 50-cycled data are plotted 


not be the limiting factor in the later cycles. 
However, the variation of the neighboring par- 
ticles’ volumes, VolumeStd, shows an opposite 
trend, featuring a higher contribution score 
in the 50-cycled electrode. This observation in- 
dicates that, upon prolonged battery operation, 
the uniformity in the neighboring particle size 
becomes a more relevant factor that affects the 
particle damage. Mixing particles of different 
sizes has been used as a method to improve 
the electrode’s packing density. Our result 
suggests that this approach should be carefully 
examined from the long-term cyclability per- 
spective. Another finding is that the particle 
elongation, Elongation, which represents the 
ratio of the longest axis length to the shortest 
axis length (table S1), has a rather high con- 
tribution score, which, however, decreases upon 
cycling. By contrast, the alignment of the neigh- 
boring particles, OrienIso, which shows negli- 
gible contribution in the 10-cycled electrode, 
becomes more important in the later cycles. In 
advanced battery electrode manufacturing, 
particle alignment can be purposely adjusted 
by controlling externally applied electric and/or 
magnetic fields (40-42). 

When visualizing the overall picture of our 
statistical analysis over thousands of particles, 
we find an interesting pattern: In the early 
cycles, individual particles’ characteristics 
(e.g., the position Z, VSratio, Sphericity, and 
Elongation; table S1) predominantly determine 
their respective degrees of damage, featuring an 
asynchronous behavior that is in agreement with 
our theoretical modeling result. In the later 
cycles, however, the interplay among neighbor- 
ing particles (e.g., Contact, DisNearest, OrienIso, 


and PDensity) becomes more important, which 
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on the top. 


indicates that the local interparticle arrangement 
can critically affect the asynchronous-to- 
synchronous transition. The mean difference 
and standard deviation (SD) of each attribute’s 
contribution scores between the 10-cycled and 
50-cycled datasets is presented on the top of 
Fig. 4, featuring a valley on the left and a peak 
on the right, supporting the above-described 
observation. 

Our experimental observations (Fig. 2) and 
machine learning analysis (Fig. 4) collectively 
corroborate the theoretical modeling (Fig. 3). 
These results reveal a transition from the 
asynchronous behavior in the early stage 
toward a synchronous state later in the par- 
ticle network evolution, where the interplay 
among neighbor particles plays a facilitating 
role (fig. $12). Particles’ self-attributes, togeth- 
er with the dynamic nature of the conductive 
network, jointly determine the damage behav- 
ior of NMC particles in composite electrodes. 
These are critical factors for cathode design to 
prolong the cycle life of batteries. On the basis 
of our results, in the active cathode powder, 
it is useful to suppress the particle-to-particle 
variation in their structural characteristics, 
such as particle size, sphericity, elongation, 
etc. At the electrode scale, an ordered par- 
ticle arrangement is favorable, which can be 
reinforced through a field-guided approach. 
Whereas the in-plane homogeneity is desir- 
able, in the out-of-plane direction, a structural 
gradient could be beneficial because of the 
electrochemical polarization, which is more severe 
in thick electrodes. To summarize, an ordered 
electrode configuration with tailored depth- 
dependent packing of uniform active particles 
would be robust to prolonged battery cycling. 


From the synthesis perspective, the particle 
shape and structure can be tuned by control- 
ling the sintering temperature, incorporation 
of trace-element doping, designing the archi- 
tecture of the precursor, and surface coating. 
These are common synthesis strategies and 
can be scalable for mass production. For the 
electrode manufacturing, the field-guided ap- 
proach has been demonstrated to be effective 
for creating an ordered structure. This is com- 
patible with the existing electrode manufac- 
turing facilities and, thus, can be fairly cost 
effective. 
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GRAVITATIONAL WAVES 


A gamma-ray pulsar timing array constrains the 
nanohertz gravitational wave background 


The Fermi-LAT Collaboration*+ 


After large galaxies merge, their central supermassive black holes are expected to form binary 
systems. Their orbital motion should generate a gravitational wave background (GWB) at nanohertz 
frequencies. Searches for this background use pulsar timing arrays, which perform long-term monitoring 
of millisecond pulsars at radio wavelengths. We used 12.5 years of Fermi Large Area Telescope data 
to form a gamma-ray pulsar timing array. Results from 35 bright gamma-ray pulsars place a 95% 
credible limit on the GWB characteristic strain of 1.0 x 10" at a frequency of 1 year’. The sensitivity 
is expected to scale with t,,;, the observing time span, as i 6. This direct measurement provides 
an independent probe of the GWB while offering a check on radio noise models. 


ulsars are spinning neutron stars that 

emit beams of broadband radiation 

from radio to gamma-ray wavelengths 

that appear to pulse as they period- 

ically sweep across the line of sight to 
Earth (7). Millisecond pulsars (MSPs) spin at 
hundreds of hertz and pulse with sufficient 
regularity to function as celestial clocks, dis- 
tributed across the sky and throughout the 
Galaxy. Timing of individual MSPs by use of 
radio telescopes has been used to test general 
relativity and alternative theories of gravity (2). 
Long-term monitoring campaigns of ensembles 
of MSPs are used to search for low-frequency 
gravitational waves (GWs), which are expected 
to be emitted by supermassive black hole 
(SMBH) binaries that are predicted to exist at 
the centers of galaxies that have undergone 
mergers. General relativity predicts that a cir- 
cular binary with orbital frequency //2 will emit 
GWs with frequency fand amplitude of’? (3). 
When SMBH binaries have an orbital sepa- 
ration of ~0.01 pc, which is equivalent to 
~2000 astronomical units, the orbits decay 
primarily through GW emission. Because of 
this link between GW frequency and ampli- 
tude, the superposition of GWs from many 
SMBH binaries throughout the Universe is 
predicted to build up a GW background (GWB) 
with a characteristic GW strain h, following 


*Fermi-LAT Collaboration authors and affiliations are listed in the 
supplementary materials. 
}Corresponding authors: Matthew Kerr (matthew.kerr@gmail.com); 


Aditya Parthasarathy (adityapartha3112@gmail.com) 
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a power law in frequency (4) 


f a 
half) = Aen (=) () 
The spectral index a is predicted to be -2/3 
for GW-driven binary inspirals, and the 
dimensionless strain amplitude A,,, incorpo- 
rates the growth, masses, and merger rates of 
SMBHs. If SMBHs do not rapidly migrate to 
the centers of newly merged galaxies, there 
will be fewer wide binaries, reducing the GW 
power at low frequencies. Thus, the measured 
GWB is expected to carry information about 
the distribution of SMBH masses and the dy- 
namical evolution of SMBH binary systems (5). 

Searches for the GWB can be performed 
with ensembles of MSPs—known as pulsar 
timing arrays (PTAs) (6, 7)—by monitoring the 
times of arrival (TOAs) of the steady pulses 
from each pulsar, which arrive earlier or later 
than expected owing to the spacetime pertur- 
bations. Because the GWB is expected to be a 
sum of many individual sources, the induced 
TOA variations are random and differ for each 
pulsar but have a common spectrum of power 
spectral densities, P(f) 


nn = (f 


~ 1272 \year-} 


) year? (2) 


with spectral index T = 3 - 20 = 13/3 for 
SMBHs (4). This functional form has more 
power at low frequencies so is referred to 
as ared spectrum. For observations taken at 
an approximately fixed location (Earth), the 


GWB is expected to produce a signature quad- 
rupolar pattern of TOA variations, known as 
the Hellings-Downs correlation (8). 

Because the expected quadrupolar correla- 
tions are only about 10% of the total signal, 
the GWB is predicted to initially appear as a 
set of independent signals from each pulsar, 
with power spectra all consistent with Eq. 2. 
The quadrupolar distribution would only be- 
come evident in more sensitive observations. 
Radio PTAs have reported a red spectrum pro- 
cess with modest statistical significance (9-12), 
but no Hellings-Downs correlation has been 
found. These results could be compatible with 
a = -2/3 and Agwy ~ 2 x 10°” to 3 x 10°” at 
1 year’ (Fig. 1). This would be consistent 
with some predictions for the GWB (5), but 
because no spatial correlations have been 
detected, it could have other origins. 

A potential alternative explanation for this 
signal is spin noise, which is approximately 
power-law red noise intrinsic to each pulsar, 
with some MSPs observed to have a spin noise 
spectral index (IT) of 2 to 7 (13, 14). Possible 
physical origins for spin noise include turbu- 
lence in the neutron star interior (75) and 
systematic variations in the magnetic field 
and corotating plasma, which govern the ro- 
tational energy loss of the pulsar (16). Pulsars 
that have spin noise spectra with similar shapes 
but different amplitudes—which is inconsistent 
with a GWB—could masquerade as a common 
mode signal without a Hellings-Downs corre- 
lation (JO). 

Another potential noise source for radio 
PTAs is the frequency-dependent effect of 
radio propagation through plasma, including 
the solar wind and the ionized interstellar 
medium (IISM). Pulsed radio emission at fre- 
quency v is delayed by time tpyy 


TM = 


4.15 ms x Der at i )° (3) 
‘ pe GHz 


where DM is the dispersion measure, equal to 
the total electron column density. The DM of 
a pulsar can vary with time because of the 
relative motions of Earth and the pulsar. Cor- 
recting for this effect requires repeated mea- 
surements by use of multifrequency radio 
observations and the introduction of many 
additional degrees of freedom to timing models. 
Because the propagation paths of radio waves 
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Fig. 1. Constraints on the GW background from radio and gamma-ray PTAs. The inferred 
constraints on the GWB amplitude at 1 year? (Agw) are plotted as a function of publication date 
(data sources are listed in table S7) and assume o = —2/3, as predicted for the superposition 

of GWs from merging black holes. Colored symbols correspond to each of the PTAs indicated in 
the key. Upper limits at 95% confidence are shown as downward arrows, and amplitude ranges 
indicate detections of a common noise process, which could be the GWB or have other origins. The 
Fermi-LAT 95% upper limit, 1.0 x 10°", uses data obtained up to January 2021 and is plotted at a 
publication date of April 2022. The dashed red line indicates the expected scaling of the Fermi-LAT 
limit as a function of time. 
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Fig. 2. Comparison between Agw, measurements from each pulsar by using three analysis methods. 
Data points indicate the limits on an a = -2/3 GWB for 35 MSPs computed with three methods: Two TOA-based 
codes, TEMPONEST (orange stars) and ENTERPRISE (gray circles), are shown as a function of the limit from a 
photon-by-photon analysis (x axis). The dashed line indicates equality between the results of the TOA-based and 
photon-by-photon methods. Six pulsars (purple triangles) have only a photon-based analysis so are plotted 
arbitrarily at zero on the y axis. The three labeled pulsars are outliers (21). 
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through the IISM depend on vy, the DM itself is 
frequency dependent (7, 18), so some of this 
delay is intrinsically unmeasurable. Other prop- 
agation effects include a broadening of the 
pulse, which can only be corrected for bright 
pulsars, with some components also being un- 
measurable (19). Because the IISM is turbulent, 
these uncorrected delays introduce additional 
red noise to radio pulsar timing data. The var- 
iable solar wind introduces similar dispersive 
delays that can in principle be measured like 
DM variations but are only partially included 
in current models (20). Because of the wide 
angular extent of the solar wind, uncorrected 
delays would be correlated among pulsars. As 
with spin noise, IISM-induced noise with sim- 
ilar spectra could mimic a GWB signal. Pre- 
dicted noise amplitudes are similar to the 
expected GWB signal, but these predictions 
rely on assumptions about the turbulent spectra 
of the IISM, which are poorly constrained by 
data (19). Further discussion of the modeling 
and impact of noise is available in (27). 

Gamma-ray observations offer a potentially 
complementary approach: The much higher 
photon frequency means that the effects of 
the IISM and solar wind are negligible. The 
Large Area Telescope (LAT) (22), on the Fermi 
Gamma-ray Space Telescope, is sensitive to 
giga-electron volt gamma-ray photons emitted 
by MSPs. Its 2.4-sr field of view performs a 
continuous survey, covering the full sky every 
two orbits (~3 hours). Its GPS clock records 
photon arrival times with <300 ns precision 
(23), enabling pulsar timing. Analyses of LAT 
survey data have detected 127 of the more than 
400 known MSPs in the Milky Way (21, 24). 
The number of MSPs in this sample, long ob- 
serving span, and instrumental stability enable 
a gamma-ray PTA whose characterization of 
spin noise and a potential GWB signal is free 
from IISM effects. 

Using the 35 brightest and most stable 
gamma-ray MSPs and 12.5 years of Fermi- 
LAT data, we searched for the GWB using 
two different techniques (27). First, we im- 
plemented a coherent photon-by-photon anal- 
ysis that retains <1 us resolution. Second, for 
analysis with established software used for 
radio PTAs, we directly measured TOAs from 
the LAT data (25). Because the TOA estimation 
procedure requires averaging up to 1 year of 
data, this method loses sensitivity to signals 
with shorter time scales, and only 29 of the 
35 pulsars are suitable. 

For each pulsar, we searched for spin noise 
and derived an upper limit on A,y using (i) 
the photon-by-photon method and (ii) two 
TOA-based software packages, TEMPONEST 
(26) and ENTERPRISE (27). None of the 
pulsars show evidence for spin noise (27), and 
the three different methods provide consis- 
tent results for each pulsar (Fig. 2), except in 
three cases (27). 
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Fig. 3. Gamma-ray con- 
straints on different 
types of GWB sources. 
GWB amplitudes Agw 

for assumed spectral indi- 
ces a in the shaded region 
are excluded with 95% 
confidence. The symbols 
indicate the values of o 
expected for SMBH binaries 
(red star; our fiducial 
result), GWs generated 
during cosmic inflation 
(green triangle), and from 
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1.0 


hypothetical cosmic 
strings (blue circle). 


Three of the pulsars in our sample have spin 
noise measurements from radio PTAs. Using 
the power spectral indices ! measured from 
the radio timing data, we calculated 95% 
upper limits on spin noise amplitudes from 
the gamma-ray data. Our limits are below the 
previously measured values for PSR JO030+ 
0451 (10% of the measured value) and PSR 
J1939+2134 (60 to 70%) but are unconstrain- 
ing for PSR J0613-0200. This discrepancy 
might indicate contamination by residual 
IISM effects on the radio-based spin noise and 
GWB signal measurements. We combined the 
single pulsars into a PTA and estimated Agwp 
limits under a variety of scenarios, including 
marginalization over possible spin noise and 
uncertainties in the position of Earth relative 
to the Solar System barycenter, and both ex- 
cluding and including the expected Hellings- 
Downs quadrupolar spatial correlations (21). 
The resulting representative 95% confidence 
limit is Agwy < 1.0 x 10° (Fig. 1), a factor of 
3 to 5 greater than the red spectrum process 
detected by radio PTAs. 

For an idealized PTA, when a potential GWB 
signal is weak compared with other noise, the 
signal-to-noise ratio grows proportionally to 
Any X bps (28, 29), where tops is the observ- 
ing time span and I = 13/3 for SMBHs, as in 
Eq. 2. This means that upper limits on Agyp im- 
prove following the relation Agwp © ia iM 
However, if the signal detected by radio PTAs 
does arise from the GWB, then these PTAs are 
now in the strong signal regime, and their 
sensitivity will improve more slowly (°< t" », 
The differing time scalings and noise sources 
allow the gamma-ray PTA data to distinguish 
residual IISM variations from a potential 
GWB signal. 

The Fermi PTA data have an essentially con- 
stant experimental setup; the data are almost 
uninterrupted, and calibrations have been con- 
stant for the full 12.5-year dataset. Gamma-ray 
data are potentially less subject to astrophysical 
effects, such as changes in the radio pulse shape 
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(21). This stability is particularly useful for 
probing GWs with frequencies below 0.1 year’. 
Such low frequencies are predicted to constrain 
the spectral shape of the GWB, which contains 
information about the physical sources (5). 

There are other potential sources of power- 
law GWBs with different spectral indices, a, 
such as o = -1 for relic GWs originating during 
scale-invariant inflation in the early Universe 
(30). Decay of (hypothetical) cosmic strings 
could also produce power-law spectra under 
a variety of scenarios (37). To constrain such 
sources, we computed corresponding 95% 
upper limits on Az, at different values of a 
(Fig. 3). Other models are not well described 
by power laws, but their largest predicted sig- 
nals are in or near the PTA band (32, 33). 

We have used the Fermi-LAT dataset to con- 
struct a gamma-ray PTA. This provides an 
independent method to search for signals 
detected with radio PTAs. Unlike the radio 
PTAs, this method is free from the effects 
of the IISM. Most of the pulsars are amenable 
to the TOA-based approach, and the resulting 
datasets are small compared with those of 
radio PTAs, enabling analysis alongside radio 
PTA data with little additional computational 
burden. 
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EXTINCTION 


Avoiding ocean mass extinction from 


climate warming 


Justin L. Penn'2* and Curtis Deutsch’?* 


Global warming threatens marine biota with losses of unknown severity. Here, we quantify global and 
local extinction risks in the ocean across a range of climate futures on the basis of the ecophysiological 
limits of diverse animal species and calibration against the fossil record. With accelerating greenhouse 

gas emissions, species losses from warming and oxygen depletion alone become comparable to current 
direct human impacts within a century and culminate in a mass extinction rivaling those in Earth’s past. Polar 
species are at highest risk of extinction, but local biological richness declines more in the tropics. Reversing 
greenhouse gas emissions trends would diminish extinction risks by more than 70%, preserving marine 
biodiversity accumulated over the past ~50 million years of evolutionary history. 


uman activities are altering the global 
climate, physically transforming habi- 

tats, and overexploiting ecosystems of 

land and sea (J, 2). As a result, rates 

of species extinction have risen above 
natural background levels (3, 4). Documented 
extinctions are largely confined to land, where 
industrial human impacts began earlier and 
remain more pervasive despite rapid growth 
in commercial fishing, marine pollution, and 
transport (5). Climate change may eventually 
eclipse direct local human threats by causing 
widespread habitat loss through changes in 
the thermal and chemical conditions that reach 
even the most remote biomes on Earth, in- 
cluding the deep and open ocean (6, 7). However, 
climate’s impact on global biodiversity is chal- 
lenging to observe, especially for undersampled 
marine environments (8), and common statis- 
tical models are difficult to validate, particu- 
larly as new climate conditions emerge (9-11). 
The potential for substantial biodiversity loss 

is illustrated by the fossil record, where long- 
term diversification (12, 13) is punctuated by 
episodic extinctions of varying intensity 
(Fig. 1A and fig. SIA). The most extreme 
events, the “Big 5” mass extinctions, coin- 
cided with global environmental changes, 
although mechanisms driving biodiversity col- 
lapse remain uncertain (3). In the largest such 
event, the end-Permian “Great Dying,” loss of 
more than two-thirds of marine animal genera 
reduced biodiversity to near its minimum 
since animals first radiated (72-14). Similar 
environmental changes that occurred in the 
end-Permian, including rising temperatures 
and declining ocean Os, productivity, and pH, 
are now also underway in the Anthropocene 
(fig. S2; 6, 7, 14-17). Past mass extinctions pro- 
vide the only empirical benchmarks for eva- 


1School of Oceanography, University of Washington, Seattle, 
WA 98195, USA. *Department of Geosciences, Princeton 
University, Princeton, NJ 08544, USA. 

*Corresponding author. Email: jpenn@princeton.edu (J.L.P.); 


cdeutsch@princeton.edu (C.D.) 


524 29 APRIL 2022 » VOL 376 ISSUE 6592 


luating the severity and drivers of a “sixth 
mass extinction” (3, 18). 

Here, we project global and local extinction 
risks for marine animals (as a percentage of 
species lost) on the basis of habitat loss from 
climate change using an ecophysiological 
model that predicted the severity and latitude 
pattern of the end-Permian extinction (17, 19). 
Multicentury climate and ocean conditions 
are simulated by a group of Earth system 
models that reproduce historical global warm- 
ing trends (table S1; 6, 20) and project future 
climate changes under scenarios of high and 
low anthropogenic greenhouse gas emissions 
(Fig. 1C, inset; 6, 7, 21). Global marine bio- 
diversity is represented through a set of >10* 
simulated animal species types (“ecophysio- 
types”; 17), defined by thermal and hypoxia 
tolerance traits (table S2 and fig. S3) that 
structure the current biogeography of diverse 
taxa (22-24). Model species are assigned traits 
with frequencies found in a global species data 
compilation (22), allowing predicted patterns 
and climatic changes in biological richness to 
be compared with and tested against richness 
observations. 

Ocean habitability requires conditions to 
remain within a species’ ecophysiological 
tolerances. Local O. must meet temperature- 
dependent metabolic demands for growth and 
ecological activity, imposing covarying upper 
temperature and lower O, limits (22, 23). A 
loss of aerobic habitat causes local species 
extirpation wherever ocean warming and O» 
loss drive the Og supply-to-demand ratio 
(®) below a species’ critical threshold (®,,;t) 
(17, 24, 25). Aerobic habitat losses of model 
species are comparable to those within the 
known geographic ranges of real species with 
corresponding traits (fig. $4). New habitat can 
also be gained at the cold edge of a species’ 
range if temperatures there rise above its mi- 
nimum tolerance (fig. S5; 25, 26). 

Species are committed to global extinction 
if net habitat loss exceeds a critical fraction 
(Veit) beyond which a viable population can- 


not be sustained even if disappearance is 
gradual (i.e., the extinction debt) (27). This un- 
certain extinction threshold is varied around a 
central value that is calibrated by the end- 
Permian extinction (fig. S6; 17, 19). We also 
considered the widest possible range of spe- 
cies capacities to colonize new habitat, denoted 
0 and 100% colonization (Fig. 1B; 79), using 
the median of these scenarios as our central 
extinction case. 

As anthropogenic climate change acceler- 
ates, so too do projected species losses (fig. S7). 
The rate of these losses over time depends 
on the emissions scenario, which sets the pace 
of warming and ocean Og, loss. By the latter 
half of this century, the divergent greenhouse 
gas emission scenarios will lead to markedly 
different climate trajectories and to growing 
disparities in the fraction of species lost globally 
(extinction) and locally (extirpation). 

The eventual intensity of species losses for 
all emissions scenarios, climate models, and 
time periods is well predicted by the mag- 
nitude of global surface warming (Fig. 1, B 
and C). Total warming in turn is governed by 
cumulative emissions and varies across models 
with different climate sensitivities. For a given 
temperature change, extinction risk depends 
on the amount of O, lost (R? = 0.69 to 0.87, 
range across colonization scenarios), which 
also differs among models (fig. S8). 

Under the low-emissions scenario, global 
temperature stops rising after ~1.9° + 0.5°C of 
warming [intermodel mean, SD; (6)] by the 
end of the century, and species losses remain 
close to current commitments (Fig. 1B, inset, 
and C). Under the high-emissions scenario, 
surface air warming could reach ~4.9° + 1.4°C 
by 2100 CE and ~10° to 18°C over the next three 
centuries (6), markedly elevating losses (Fig. 1, 
B and C). Significant losses are also expected 
in another high-emissions scenario (SSP3-7.0), 
which yields 8.2°C of warming by 2300 CE 
(5.7° to 11.8°C, 5 to 95% range) in a reduced- 
complexity model that does not simulate O. 
(6). Global extinction risk is higher if species 
cannot gain new habitat (Fig. 1B). Colonization 
of new regions at species’ cold-edge range 
boundaries partially compensates for aerobic 
habitat loss (25) but eventually ceases to main- 
tain habitat as warming intensifies. 

We compared extinction risks from future 
climate change with those from current direct 
anthropogenic threats (fig. S9; 19) on the basis 
of vulnerability assessments from the Interna- 
tional Union for Conservation of Nature IUCN; 
28). Future species losses from warming would 
become comparable to the sum of all anthro- 
pogenic stressors at the end of this century 
(Fig. 1B, inset, and C). 

Under the high-emissions scenario, global 
extinction risks from continued warming even- 
tually rival the severity of past mass extinctions 
in the fossil record and in paleosimulations 
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Fig. 1. Marine extinctions in the past and risk from climate warming. 

(A) Extinction intensity (percent losses) from the fossil record of marine animal 
genera over the past ~542 million years (12), including the “Big 5" mass extinctions 
(35) (fig. SIA). (B and C) Projected global extinction (B) and global mean extirpation 
risks (averaged over 1° latitude x longitude, 0 to 500 m) (C) rise with increases in 
annual mean global surface air temperature (thick lines are intermodel averages; thin 
lines are individual Earth system models) and are plotted under historical greenhouse 
gas emissions (petagrams of carbon per year) and divergent future scenarios [(C), 
inset], yielding radiative forcings of 2.6 W/m* [i.e., RCP/SSP1-2.6; (6, 19, 21)] versus 
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8.5 W/m? in 2100 [i.e., RCP/SSP5-8.5; (6, 19, 21)]. Inset in (B) zooms in to show end 
of the century changes. Percent colonization refers to the fraction of new habitat that a 
species can instantly disperse to and inhabit (thin dashed line is the median case). 
Extinction risks from current anthropogenic threats are estimated from IUCN 
vulnerability assessments (orange bars; tables S3 and S4; 19). Simulated end-Permian 
extinction risks are shown across colonization scenarios in (B) and in (C) display 

the global mean and spatial SD of extirpation. Points and horizontal lines on the top 
axis show the average and range of warming across Earth system models evaluated 
here, respectively, for different time horizons and emissions scenarios. 
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Fig. 2. Spatial variation in species losses and marine biological richness. 
(A) By 2100, regions of strong extirpations overlap past productive fisheries 
(blue points), where catch rates exceed the global median from 1950 to 2014 
(36). (B to D) Patterns of extirpation risk (B), marine biological richness (C), and 
global extinction risk [(D); averaged across colonization scenarios] are shown 
versus latitude. Observed biological richness (number of species) estimated 


(Fig. 1B). A mass extinction in this emissions 
scenario is projected across all potential 
values of the extinction threshold except in 
the unlikely case that the average species can 
maintain a viable population in <10% of 
their initial habitat volume (fig. S10; 19). By 
contrast, limiting warming to 2°C would cut 
the severity of extinctions by >70%, avoiding a 
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marine mass extinction across all extinction 
thresholds. 

Climate-driven species losses vary widely 
among ocean biomes, with important implications 
for fisheries and the patterns of biological 
richness (Fig. 2). These spatial patterns also 
provide crucial observational tests of the un- 
derlying model. 


Marine Biological Richness 


-e Observations 
— Model 


0.5 1 0 50 100 
Extinction (%) 


= 2300 high emissions 


using rarefaction (31) is reproduced by the trait-based habitat model 
applied to climatological distributions of temperature and O2 (37, 38) across a 
range of maximum summation depths [(C) line is 500 m; shading is 0 

and 5000 m; (19)]. Richness is normalized to the maximum observation. 
Extirpation and global extinction risks are averaged from 0 to 500 m, across 
Earth system models, and across longitude in (B) and (D). 


Extirpation risk is greatest where climate 
anomalies are strong (fig. S2) or species are 
living close to their ecophysiological thresholds, 
typically equatorward near the warm and/or 
low-O, edges of their ranges (Fig. 2, A and B). 
Vulnerable regions include highly productive 
ecosystems where background O, is already low, 
such as the north Pacific, eastern boundary 
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Fig. 3. Past and potential futures of marine biological richness. The number of marine animal genera 


is plotted over time from the fossil record relative to present and projected into the future on 


he basis of 


model extinction risks, averaged (lines) and varying (SD, shadings) across Earth system models and colonization 
scenarios. The right axis shows the change in global mean temperature for a given richness loss. Vertical dashed 
lines denote the “Big 5" mass extinctions. The lower fossil curve is based on Sepkoski’s Compendium of Fossil 
Marine Animal Genera (12), and the upper curve presents these data with the secular trend found in the 


Paleobiology Database (https://paleobiodb.org) (fig. S12; 


the current Anthropocene. Time scale differs for past and 


upwelling systems, and the tropical Indo- 
Pacific (Fig. 2A). These regions are also home 
to many of the world’s most productive fish- 
eries, which supply ~17% of humanity’s dietary 
protein (29). The spatial pattern of extirpation 
is relatively stable over time, but its magnitude 
steadily rises with warming (Fig. 2B). Low 
extirpation intensities are predicted to be evi- 
dent at low and northern midlatitudes already 
(Fig. 2B) and may underlie the previously 
documented range shifts observed there 
(24-26, 30). Earth system models may under- 
represent both small-scale spatial refugia as well 
as short-term heat waves and related extreme 
events. Although these phenomena will undoubt- 
edly modulate local impacts on some species, 
they are unlikely to override the broader biotic 
outcome from the persistent large-scale and 
long-term climate trends presented here. 
The latitudes of strong fractional species ex- 
tirpation also overlap regions of peak biologi- 
cal richness (Fig. 2, B and C). The number of 
observed marine animal species increases from 
the poles toward the tropics, with a reduction 
near the equator (37-33). The model reproduces 
this pattern: As temperatures rise above species’ 
minimum tolerances, richness increases from 
the poles to the tropics but decreases ap- 
proaching the equator, where species reach 
their temperature-dependent hypoxia limits. 
In contrast to local extinctions, global ex- 
tinction risk is greater for polar species than 
for tropical ones, threatening higher-latitude 
richness where extirpation is weaker (Fig. 2, C 
and D). Species initially inhabiting the tropics 
can tolerate warm, low-O, waters, making 
them resilient to the climatic expansion of 


those conditions, especially for species with 
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13). Letters indicate major stratigraphic intervals plus 
future (see fig. S1B for both plotted on the same scale). 


high colonization ability (fig. S11). By contrast, 
polar species occupy a disappearing climate 
niche and lack habitat refugia as the climate 
warms. This latitudinal extinction pattern has 
been detected in the fossil record of the end- 
Permian extinction (17, 34), supporting the 
mechanistic basis for model projections. 

The projected impact of accelerating climate 
change on marine biota is profound, driving 
extinction risk higher and marine biological 
richness lower than has been seen in Earth’s 
history for the past tens of millions of years 
(Fig. 3 and fig. S1). Additional climate-related 
threats beyond warming and O, loss, includ- 
ing ocean acidification and declining primary 
productivity, have the potential to amplify 
these losses even further (6, 7, 18). However, 
it is not too late to enact the reductions in 
greenhouse gas emissions needed to avoid a 
major extinction event. The low-emissions 
scenario assumes that declines began around 
2020 CE and continue thereafter (Fig. 1C, inset). 
Coordinated efforts to slow the local impacts 
of overfishing and marine pollution demon- 
strate the high intrinsic and economic value of 
preserving marine biodiversity and living re- 
sources. Realizing the fruits of these conservation 
efforts depends on mounting complementary 
societal responses to avert the greater threat 
posed by climate change. 
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Discovery chemists routinely identify purpose-tailored molecules through an iterative structural 
optimization approach, but the preparation of each successive candidate in a compound series can 
rarely be conducted in a manner matching their thought process. This is because many of the necessary 
chemical transformations required to modify compound cores in a straightforward fashion are not 
applicable in complex contexts. We report a method that addresses one facet of this problem by allowing 
chemists to hop directly between chemically distinct heteroaromatic scaffolds. Specifically, we show that 
selective photolysis of quinoline N-oxides with 390-nanometer light followed by acid-promoted 
rearrangement affords N-acylindoles while showing broad compatibility with medicinally relevant 
functionality. Applications to late-stage skeletal modification of compounds of pharmaceutical interest 
and more complex transformations involving serial single-atom changes are demonstrated. 


he maturation of chemical synthesis has 

given rise to an era in which molecules 

can be exhaustively optimized to serve 

specific purposes under exceptional mullti- 

dimensional constraints, enabling increas- 
ingly precise applications of these compounds. 
The intensity of this enterprise manifests most 
visibly in medicinal chemistry, where the simul- 
taneous management of efficacy, specificity, 
absorption, and lifetime is accomplished through 
meticulous tailoring of promising candidate 
molecules (J, 2). While these molecular opti- 
mizations establish structure-activity relation- 
ships by iterative modification of a series of 
parent compounds, the synthetic practice un- 
derlying these campaigns is rarely in line with 
its philosophical roots. Rather than convert- 
ing a lead compound to the next candidate in 
a manner matching their underlying thought 
process, these campaigns instead largely rely 
on iterative resynthesis, because the reactions 
necessary to perform the envisioned direct con- 
version are often not applicable in complex 
settings (Fig. 1A) (3, 4). This shortcoming is 
particularly conspicuous when conducting a 
“scaffold hop’—a common strategy that leverages 
computational estimates of three-dimensional 
molecular similarity (or in silico binding affinity 
to the target) to predict isofunctional structures 
with distinct cores (5, 6). The logic of this strat- 
egy can be immediately appreciated by compar- 
ing members of a given class of pharmaceuticals, 
for example, the cholesterol-lowering thera- 
peutics pitavastatin and fluvastatin or the anti- 
inflammatory drugs etoricoxib and celecoxib; 
this same logic is also clear when comparing 
compounds in a given development series, 


Department of Chemistry, University of Chicago, Chicago, IL, 
USA. *Discovery Chemistry, Merck & Co., Inc., Boston, MA, 
USA. ?Analytical Research and Development, Merck & Co., 
Inc., Boston, MA, USA. 

*Corresponding author. Email: marklevin@uchicago.edu 


SCIENCE science.org 


such as the dideazafolic acid antecedent to 
the chemotherapy agent pemetrexed (Fig. 1B) 
(7-9). Unfortunately, the execution of a pre- 
dicted scaffold hop is among the most difficult 
of possible lead optimization strategies to per- 
form directly. Unlike diversification strategies 
relying on robust, late-stage coupling reactions 
that can target some peripheral substruc- 
tures of a lead molecule for rapid interroga- 
tion, molecular cores are far more challenging 
to examine in a similar fashion because of the 
often-distinct preparative methods for the rel- 
evant (hetero)cyclic frameworks. As such, chem- 
ists interested in examining a scaffold hop 
are typically required to resort instead to an 
effective reset of their synthetic campaign, 
beginning from scratch to traverse laterally 
in chemical space. 

Accordingly, a pressing challenge and in- 
creasing recent area of focus for modern organic 
synthesis is the development of transforma- 
tions that can address the molecular skele- 
ton with precision and enable direct scaffold 
hops between distinct core substructures within 
a given class of compounds. Ideally, such trans- 
formations would enable control at the level 
of single-atom precision (Fig. 1C), with more 
sophisticated changes possible through itera- 
tive elementary skeletal modifications (10). No- 
table recent contributions from several groups 
have been reported in the context of saturated 
aliphatic heterocycles (JJ-13). Although the 
centrality of aromatic and heteroaromatic scaf- 
folds in medicinal chemistry suggests a clear 
priority for similar azaarene interconversion 
strategies, the stability of aromatic systems 
poses a substantial challenge: The reactive 
species typically required to breach the core 
are often not compatible with densely func- 
tionalized druglike compounds and rarely 
promote precise, selective downstream chem- 
istry (4-16). We report here a transformation 
that confronts this challenge, enabling a broad- 


ly applicable ring contraction of quinoline 
N-oxides and related azaarenes. Subsequent 
deacylation of the product N-acylindole allows 
this transformation to serve as a net carbon 
deletion (Fig. 1D). 

This advance is built on the classical photo- 
chemistry of quinoline N-oxides (1), whose di- 
verse rearrangement products were meticulously 
cataloged by Buchardt, Streith, Kaneko, and 
Albini (Fig. 2A) (77-19). Although JV-acylindoles 
(2) and related hydration products have been 
observed arising from a limited set of substrates, 
more complex rearrangement products often 
predominate, including quinolones, 2- and 
3-acylindoles (bearing noncleavable acyl groups), 
and 3-hydroxyquinolines. Product mixtures of 
these compounds are typically observed, and 
in many cases, seemingly minute perturbations 
to the substrate structure result in drastic 
changes to the product distribution (see figs. 
$20 to S28 and the associated discussion for 
a brief summary). Beyond this, the classical 
mercury (Hg) lamp irradiation conditions 
are incompatible with many complex quino- 
lines of relevance to medicinal chemistry (an 
observation we have reproduced—see below). 
On the basis of prior mechanistic work on these 
and related photochemical transformations, 
we suspected that the undesired products were 
the result of secondary photoprocesses of the 
intermediate, formally antiaromatic 2,1- and/or 
3,1-benzoxazepines (3) (20, 27). We hypothe- 
sized that these two-photon by-products could 
be avoided using a milder, narrow-spectrum 
light source. 

Indeed, we have found that the use of 390-nm 
light-emitting diodes (LEDs) in place of tradi- 
tional mercury lamps substantially improves this 
classical photoreaction, turning an academic 
curiosity into a potential workhorse transforma- 
tion with broad utility (22-27). This selective 
irradiation of quinoline N-oxides produces high 
yields of the corresponding 3,1-benzoxazepine 
in the photolysate, with subsequent in situ treat- 
ment with an acid catalyst promoting isomer- 
ization to the N-acylindoles in similarly high 
yield. As detailed below, this enables challeng- 
ing indole and azaindole syntheses; facilitates 
late-stage, direct scaffold hopping of medic- 
inal compounds; and serves as a productive 
springboard for further skeletal modification 
strategies. 

We began our investigation with 2-methyl- 
quinoline N-oxide (1a) (Fig. 2B). Irradiation with 
a 390-nm LED in toluene at ambient tempera- 
ture for 5 hours resulted in complete consump- 
tion of the quinoline N-oxide, affording the 
corresponding benzoxazepine 3a in 91% nu- 
clear magnetic resonance (NMR) yield, along 
with a minor quantity of the deoxygenation 
product 4a (15:1 selectivity). Although 3a was 
not isolable without substantial decomposi- 
tion, its conversion could be monitored by LED- 
NMR, allowing measurement of a quantum 
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Fig. 1. Introduction to scaffold hopping and single-atom skeletal editing. (A) Schematic representation of the disconnect between design and synthesis in 
molecular optimization. (B) Selected examples of scaffold hopping in medicinal chemistry. HMG-CoA, 3-hydroxy-3-methylglutaryl coenzyme A; Me, methyl group; 
COX-2, cyclooxygenase-2. (€) Single-atom skeletal editing for heterocycle interconversion. (D) Carbon deletion of azaarenes delineated in this work. 


yield, ®, of 0.096 for conversion of the quino- 
line N-oxide (28). In contrast to 3a, cyano- 
substituted benzoxazepine 3b was found to be 
readily isolable and could similarly be prepared 
by irradiation with a 390-nm LED in 83% 
isolated yield. The increased stability of 3b 
offered an opportunity to probe our light source 
hypothesis. As predicted, 3b was found to be 
substantially more stable upon further irra- 
diation by the LED than under mercury lamp 
irradiation (Fig. 2C). The benzoxazepine was 
returned in near-quantitative recovery after 
6 hours in the former case, whereas nearly 
half of the material degraded to a mixture of 
products in the same period under the latter 
conditions. Examination of the absorption 
spectra of Ib and 3b reveals the origin of this 
drastic light source effect (Fig. 2D). Whereas 
the quinoline N-oxide has a relative absorption 
Maximum, Amax, at 386 nm, the benzoxazepine 
shows a substantial hypsochromic shift to a Amax 
of 323 nm, such that the LED accomplishes 
selective irradiation of the starting material 
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while the mercury lamp promotes photodegra- 
dation of the benzoxazepine. 

Our interest in the photochemical behav- 
ior of quinoline N-oxides stemmed not from 
the benzoxazepine intermediates themselves 
but rather from the indole products presumed 
to arise from them. Classical studies had sug- 
gested that these indole products were the re- 
sult of adventitious (or added) water generating 
acid, which in turn acted on the 3,1-benzoxazepine 
(29). This prompted us to examine the effect 
of exogenous acid additives. The crude photo- 
lysate consisting predominantly of methyl- 
substituted benzoxazepine 3a reacted smoothly 
under the action of trifluoroacetic acid to af- 
ford acylindole 2a in 78% yield relative to 1a. 
To determine the mechanism by which the 
protonated benzoxazepine evolves to product, 
we conducted an ‘80 labeling study, which 
showed substantial but incomplete mainte- 
nance of the isotopic label in the hydrolysis 
process. This result is most consistent with 
two concurrent pathways for benzoxazepine 


hydrolysis, although the potential for 'O- 
water liberated in the N-protonation pathway 
to react further precludes a quantitative anal- 
ysis of the partitioning between these path- 
ways (Fig. 2E) (30). 

Although direct photochemistry can often 
vary as a function of substrate structure, the 
efficacy and advantage of the 390-nm LED 
was found to be surprisingly general, both with 
respect to the substituent on the excised carbon 
and the residual indole substituents (Fig. 3). A 
wide range of quinoline N-oxides with varying 
substitution patterns were found to undergo 
facile photorearrangement to afford the cor- 
responding benzoxazepine, and the subsequent 
acid-promoted rearrangement was likewise 
found to be generally applicable. For most sub- 
strates, trifluoroacetic acid was effective for 
this latter operation, although in cases bear- 
ing 3-substituents (1m, lac, 1x, and lag) or 
electron-withdrawing groups (1b and Is) on 
the quinoline, the more acidic para-toluene- 
sulfonic acid afforded higher yields. This protocol 
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Fig. 2. Mechanistic basis for light source effects in quinoline N-oxide pho- N-oxide Ib. ppm, parts per million. (C) Photostability study of 3b under LED and Hg 
tochemistry and mechanism of acid-promoted benzoxazepine rearrangement lamp irradiation. (D) Ultraviolet-visible absorption spectra of 1b and 3b with overlaid 
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broadband mercury lamp irradiation. (B) LED-NMR study of 390-nm LED photolysis of — lamp (normalized to an emission intensity of 1 for Amax). (E) Labeling study of 

la and isolation of 2-cyano-3,1-benzoxazepine (3b) from LED photolysate of quinoline trifluoroacetic acid (TFA)-promoted rearrangement of 3a. 


was also found to be amenable to a one-pot | reaction mixture with toluene (95:5) before | As noted above, this transformation was found 
process, with initial formation of the N-oxide | irradiation affording the N-acylindole in only | to be robust to a wide variety of functionality 
induced by hydrogen peroxide in dichloro- | slightly diminished yields (69%) relative tothe | commonly encountered in medicinal chemis- 
methane followed by dilution of the crude | two-pot procedure with purified N-oxide (78%). | try, including other heterocycles (2d, 2e, 2f, 
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Fig. 3. Scope of the photochemical conversion of azaarene N-oxides into 
acylazoles. Conditions: 1 (0.3 mmol), toluene (0.06 M), 390-nm LED, 0.5 

to 5 hours at 25°C, then TFA (0.3 mmol) for 2 to 5 hours at 25°C. Isolated yields 
are given. The asterisk symbol indicates where acidolysis was conducted with 
TsOH*Hz0 instead of TFA. The single-dagger symbol indicates where acidolysis 
was conducted at 60°C. The double-dagger symbol indicates where, instead of 
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from Pitavastatin 
(HMG-CoA reductase inhibitor) 


(leukotriene receptor antagonist) 


acidolysis, photolysate was concentrated and refluxed in H20/MeOH (1:1). 
Hg lamp yields measured by proton nuclear magnetic resonance versus 

an internal standard. Ph, phenyl group; OMe, methoxy group; MTO, methyl- 
trioxorhenium; BMIDA, N-methyliminodiacety! boronate; Boc, butoxycarbonyl: 
"Pr, n-propyl; TSOH, para-toluenesulfonic acid; (BzO)2, benzoyl peroxide; 

‘Bu, tert-butyl. 
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2g, and 2i), polyhalogenation (2v, 2w, and 
2x), carbamates (2y, 2z, 2aa, and 2ab), phos- 
phonates (20), sulfones (2s), and boronic esters 
(2q). Notably, the scope demonstrated here 
was found to be a direct consequence of the 
milder light source. Of 12 quinolines exam- 
ined, five gave detectable indole products at 
substantially diminished yield (3 to 13%) when 
a mercury lamp was used for the initial photol- 
ysis, with the remaining seven giving completely 
intractable mixtures. Additionally, complete 
solubility in toluene was not a requirement, 
with dissolution over the course of irradiation 
observed for a number of substrates (e.g., Is, 
1q, ly, laf, and 1aj). In all cases, full conver- 
sion of the starting material was achieved with 
sufficient irradiation time. The most common 
by-product observed was deoxygenation of the 
N-oxide to afford the parent azaarene. Limi- 
tations of the photolysis were principally re- 
lated to the 2-substituent (hydrogen, tertiary 
alkyls, and heteroatom substitution were not 
tolerated; see supplementary materials for 
further details). We also note that oxidatively 
sensitive functionality is not maintained in the 
initial N-oxidation (e.g., sulfides are converted 
to sulfones). 
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An interesting consequence of the switch 
between electron-poor quinoline and electron- 
rich indole heterocycles is the ability to inter- 
face their distinct reactivities and syntheses 
through carbon deletion. This is exemplified 
in the first instance by the preparation of indole 
2af through Minisci alkylation of the parent 
quinoline at the 4-position, resulting in the net 
3-alkylation of the final, nucleophilic indole 
product with an nucleophilic radical—a chal- 
lenging retrosynthetic strategy to realize via 
known methods (31-33). Quinoline 4ag dem- 
onstrates the latter interplay of the two hetero- 
cyclic scaffolds, allowing the Pfitzinger quinoline 
synthesis to serve additionally as an indole syn- 
thesis (34). The product 2ag is related to the 
anti-inflammatory medicine indomethacin (35). 

Higher polyazaarenes were also found to be 
productive substrates, enabling the preparation 
of 7-azaindole, pyrrolopyrazole, pyrroloisoxazole, 
and benzimidazole scaffolds through net carbon 
deletion of the parent fused-ring azine. The 5,5- 
fused systems are highly challenging to pre- 
pare by traditional heterocycle syntheses and 
thus showcase a distinctive advantage of our 
approach. To further highlight the utility of 


this method, we demonstrated its capacity to 


modify complex medicinal compounds. Start- 
ing from montelukast (Singulair), a widely 
prescribed leukotriene inhibitor, the pendant 
chloroquinoline could be transformed into the 
corresponding acylindole 2al (36). Finally, the 
direct scaffold hop from pitavastatin to its 
indole congener 2am could be accomplished, 
creating a link in chemical space to fluvastatin 
via carbon deletion (37). 

As noted above, linear combinations of dis- 
tinct single-atom insertions and deletions offer 
exciting opportunities to devise more complex 
skeletal editing transformations. Figure 4 show- 
cases the ways in which carbon deletion can 
be leveraged as a foundation for such strate- 
gies using a simple model system. Starting with 
quinoline 4an, carbon deletion affords the 
indole 5an, with the photorearrangement scal- 
able up to 1 g in flow. Subsequent application 
of our previously reported C3-selective carbon 
insertion reaction gives the isomeric quinoline 
4ao, which has formally had its C2 and C3 
substituents swapped relative to the starting 
4an (38). This quinoline can again be sub- 
jected to carbon deletion to afford indole 5ao. 
Here, comparison to its predecessor 5an re- 
veals the effective replacement of the methyl 
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substituent with a phenyl. Finally, if indole 
Sao is subjected to nitrogen insertion through 
a precedented N-amination and oxidative aro- 
matization sequence, cinnoline '7ao can be 
accessed, now the formal C-to-N exchange 
product of starting quinoline 4ao0 (39-41). 

This work offers a broadly applicable, C2- 
selective, net carbon deletion of quinolines 
and related azaarenes through a ring contrac- 
tion of the corresponding N-oxides. Avoiding 
deleterious overreaction through selective 
photoexcitation renders classical N-oxide photo- 
chemistry applicable to medicinal chemistry 
applications. This work further showcases the 
potential for direct scaffold hopping enabled 
by the carbon deletion transform, especially 
when used in combination with the growing 
library of single-atom skeletal edits. 
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Accelerating reaction generality and mechanistic 
insight through additive mapping 
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Reaction generality is crucial in determining the overall impact and usefulness of synthetic methods. 
Typical generalization protocols require a priori mechanistic understanding and suffer when applied to 
complex, less understood systems. We developed an additive mapping approach that rapidly expands 
the utility of synthetic methods while generating concurrent mechanistic insight. Validation of this 
approach on the metallaphotoredox decarboxylative arylation resulted in the discovery of a phthalimide 
ligand additive that overcomes many lingering limitations of this reaction and has important mechanistic 


implications for nickel-catalyzed cross-couplings. 


ver the past century, organic chemists 
have invented a substantial number of 
catalytic bond-forming reactions (Fig. 1A). 
Many of these innovative transformations 
enable streamlined access to high-value 
molecular motifs. However, despite the vast 
and growing body of known chemical reac- 
tions, only a select few are routinely used by 
organic chemists and researchers in adjacent 
fields. Among these are olefin metathesis, the 
Suzuki coupling, and the Buchwald-Hartwig 
coupling (1-3). 
The few transformations that have made 
the leap from invention to mainstay reaction 
share a key feature: They are high yielding and 
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robust and capable of readily accommodating 
a wide range of substrate functionality and 
complexity. It is generally accepted that the 
elusive attributes of substrate generality and 
reaction efficiency cannot be anticipated, and 
it typically takes years of rigorous study to fully 
optimize the scope and yield of a challenging 
reaction (7-7). As outlined in Fig. 1, traditional 
reaction generalization is an iterative process 
that begins with careful mechanistic investiga- 
tion. The insights gained in this exercise may 
suggest rational modifications that can lead to 
incrementally improved performance through 
elaborate catalyst optimization (8-14). Unfor- 
tunately, this approach becomes problematic 
when applied to inherently complex catalytic 
systems, in which mechanistic insights into 
underlying issues are not straightforwardly 
achieved or leveraged (hereafter referred to 
as complex reactions). As a result, the chemical 
literature is replete with promising but under- 
utilized reactions that have yet to realize their 
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full potential because of their mechanistic 
ambiguity. 

In response to this challenge, we sought to 
develop an approach amenable to the gener- 
alization of complex reactions that also yields 
mechanistic insight that is inaccessible other- 
wise. Inspiration was drawn from the medicinal 
chemistry practice of phenotypic screening 
(15-19). A phenotypic screen involves the ap- 
plication of libraries of chemically diverse com- 
pounds to entire biological systems of interest 
while searching for desirable changes in the 
observable traits (the phenotype). This ap- 
proach has the potential to simultaneously 
uncover and modulate hitherto unrecognized 
biomolecular mechanisms. It has proven in- 
valuable in developing powerful therapeu- 
tic approaches that leverage unappreciated 
mechanisms of action and improved indica- 
tions recalcitrant to traditional drug discovery 
techniques (20-23). We reasoned that com- 
plex reactions can be likened to complex 
biological systems, so there would be enormous 
benefit to applying this concept to the study 
and improvement of the former. 

Unexpected improvements in reaction 
efficiency achieved with additives is a well- 
documented but often overlooked phenom- 
enon within organic chemistry. Examples 
include the lithium chloride effect in Stille 
couplings (24) and, more recently, work con- 
ducted by the Watson group (25), the Dong 
group (26), and our own groups (27, 28). Dis- 
covered additives can often be rationalized 
ex post facto but are nearly impossible to pre- 
dict a priori. More importantly, knowledge 
of additive effects can be leveraged to extract 
important information about the mechanism 
of the reaction itself. Rapid evaluation of an 
additive library might therefore result in sim- 
ilar unforeseen mechanistic modifications that 
benefit the reaction and lead to important 
mechanistic insights. 

Our envisioned strategy is outlined in Fig. 1 
(bottom). We selected a challenging, complex 
reaction of limited substrate scope and began 
evaluating additives in a systematic fashion. 
Using high-throughput experimentation (HTE) 
methods, we were able to identify privileged 
motifs by mapping the evaluated additive space 
onto the resulting yield. Identified hits were 
then evaluated through structure-activity rela- 
tionship (SAR) and mechanistic studies with 
the goal of identifying the optimal additive 
and gaining broader chemical insights. Ratio- 
nalization of the additive effect should gener- 
ate new, nonobvious mechanistic information 
useful to organic methodology at large. 

We sought to apply this approach to the 
construction of C(sp”)-C(sp*) bonds in an 
effort to forward the long-sought-after goal 
to “escape from flatland” within contemporary 
drug discovery (29, 30). Several elegant meth- 
ods have been developed to address this gap, 
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Fig. 1. High-throughput additive mapping to understand and improve organic methods. Shown are the 
traditional approach and select successes, as well as the proposed HTE additive mapping approach. 


but none has the requisite starting material 
availability to enable broad access to chemical 
space (31-37). The metallaphotoredox decar- 
boxylative arylation reaction initially published 
by the Doyle and MacMillan groups (38) has the 
putative advantage of broad commercial avail- 
ability of both substrates (carboxylic acid and 
aryl halide) but has not gained widespread 
traction because of shortcomings in reaction 
generality. Specifically, the decarboxylative 
arylation is not amenable to (i) coordinating 
substrates, (ii) aryl bromides prone to proto- 
dehalogenation, (iii) challenging oxidative 
additions, or (iv) nonactivated carboxylic 
acids—namely, those that result in unstabilized 
radicals upon oxidative decarboxylation (39). 
Major efforts by our group and others (40) 
using traditional optimization approaches 
have failed to either substantially expand the 
scope of this reaction or to yield any helpful 
mechanistic leads. Therefore, our additive map- 


ping approach could be well equipped to ad- 
dress this challenge. 

To begin, we focused on an additive library 
of 721 diverse organic molecules amenable to 
HTE evaluation (Fig. 2B; also see the supple- 
mentary materials for further discussion). In 
principle, this approach should also be com- 
patible with other classes of additives (e.g., 
salts, ligands, and metals). Concomitantly, we 
also selected several challenging coupling part- 
ners, including substrates with coordinating 
basic nitrogens, nonactivated carboxylic acids, 
and aryl halides that lead to sizeable quan- 
tities of Minisci and protodehalogenation 
side products. These reactions were performed 
in a nanomole-scale photoredox setup in 
which performance, as determined by reaction 
yield, was evaluated in the presence of each 
additive. 

Unsurprisingly, most additives, particularly 
those containing groups such as heterocycles, 
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anilines, and phenols, led to attenuated 
reaction efficiency (41) (Fig. 2B). However, 
certain five-membered cyclic imides and 
hydantoins were found to give a strong boost 
in overall yield, sometimes up to fivefold, as 
well as a sharp decrease in protodehaloge- 
nation. This result was unexpected given that 
coordinating functionality is traditionally 
a powerful reaction poison caused by cata- 
lyst chelation. 


A — Selected Challenging Reaction 
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Fig. 2. Additive mapping applied to the decarboxylative reaction. (A) Metallaphotoredox decarboxylative arylation. (B) High-throughput additive screening on 
challenging coupling partners revealing the imide effect. (C) Left: SAR studies to identify the ideal five-membered imide. A set of 64 imides was evaluated in 
48 couplings to reveal the best imide. Right: Phthalimide can counteract the poisoning effect of problematic functionalities. 
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To gauge improvements in the functional | range of compounds that previously served as | the overall number of reaction poisons (de- 
group tolerance, compound X2 and cyclo- | reaction poisons (e.g., 1,3-dicarbonyls and | fined as decreasing the yield by >33%) fell 
hexanoic acid were resubjected to the earlier | benzoic acids) were now well tolerated. Fur- | from 390 to 208. 
additive screen in the presence of phthalimide, | thermore, examining the data in aggregate With these exciting results in hand, we 
and overall reaction performance was subs- | showed that the overall average yield nearly | sought to benchmark the reaction improve- 
tantially better (Fig. 2C, right, and fig. $13). A | doubled in the presence of phthalimide, and | ment in a pharmaceutically relevant context 
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against the Aryl Halide Informer Library (42). 
This library is prototypical of the type of com- 
plex drug-like compounds seen in medicinal 
chemistry and contains a range of (hetero)aryl 
halides deemed to be inherently challenging 
for metal-catalyzed cross-couplings. When eval- 
uating the library against a nonactivated acid, 
we found that phthalimide had an outstand- 
ing impact. A sizeable 11 of the 18 aryl halides 
showed major performance improvements, 
and a tripling of overall average yield was ob- 
served, from 7.7 to 29.4% (Fig. 3A and fig. S14). 
Failures included compounds with free car- 
boxylic acids (X7 and X9) and aryl chlorides 
(X16 to X18), substrates that lie outside the scope 
of this transformation. The results position the 
decarboxylative arylation among the most suc- 
cessful couplings evaluated against the library. 

Encouraged by the informer results, we then 
set our sights on evaluating the new scope of 
the reaction using the phthalimide additive. A 
highly diverse array of 384 small, medicinally 
relevant aryl bromides were evaluated against 
a complex, nonactivated isonipecotic acid de- 
rivative on a nanomole scale (see the supple- 
mentary materials for details). To gauge the 
synthetic utility of the phthalimide additive, 
we used charged aerosol detection (CAD) to 
determine the yield of each reaction (43). A 10% 
reaction yield was selected as threshold for the 
potential isolability of products by mass-directed 
microisolation that was based on prior work 
(44, 45). 

With phthalimide, the number of compounds 
above our typical threshold for isolation more 
than doubled, from 70 to 187 (Fig. 3B, right, and 
fig. S17). Substantial improvements were ob- 
served for a wide variety of bromides, including 
six-membered ring systems (aryl bromides, 
pyridines, and pyrimidines), five-membered 
heterocycles (pyrazoles, imidazoles, and thia- 
zoles), and [6,5]- and [6,6]-heterobicycles 
(indoles, aza-indoles, benzimidazoles, and 
quinolines). Furthermore, phthalimide im- 
proved the functional group compatibility of 
the reaction, allowing for the presence of 
polar moieties such as 1,2-diols, phenols, and 
aminopyridines. 

We next examined the scope of 384 chem- 
ically diverse, relevant carboxylic acids against 
the structurally complex informers X1, X2, and 
X13 on a nanomole scale (1152 total combina- 
tions). The phthalimide additive produced 
important improvements in a range of pri- 
mary carboxylic acids including alpha ether, 
alpha thioether, benzylic, and, most notably, 
nonactivated acids, the latter representing a 
crucial advancement in overall reaction gen- 
erality because they were previously only 
narrowly tolerated (Fig. 3B, left, and figs. 
$21 to S23). Further improvements were like- 
wise seen for a range of cyclic carboxylic acids, 
including both activated and nonactivated 
four-, five-, six-, and seven-membered rings. 
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A boost in yield was also seen for a range of 
acyclic carboxylic acids, and 19 of 20 protected 
amino acids were cross-coupled successfully 
in the presence of phthalimide, including both 
potential sites in aspartic acid and glutamic 
acid. Finally, the overall functional group com- 
patibility of the reaction appeared far more 
robust in the presence of phthalimide. Sub- 
strates bearing a range of functionalities, in- 
cluding phenols, aldehydes, aryl chlorides, and 
B-alcohols, performed notably better with the 
additive. In aggregate, the number of reactions 
delivering products in >10% yields increased 
from 212 to 516, and the overall reaction CAD 
yield across the set increased more than two- 
fold. The improvements in both aryl bromide 
and acid scope observed in this study should, 
in a realistic setting, have a large impact on the 
generality of the decarboxylative arylation in 
synthesis at large. 

With these marked improvements in hand, 
we next sought to leverage the discovered 
effect of phthalimide to generate a funda- 
mental mechanistic understanding useful for 
future Ni-metallaphotoredox developments. 
We first focused on elucidating how phthal- 
imide turns nonactivated acids into competent 
coupling partners. Evaluating the reaction 
progress of a model system (figs. S31 and S32; 
substrate pair yields were 23% without phthal- 
imide and 81% with phthalimide), we observed 
full consumption of the aryl bromide and 
substantial protodehalogenation in the ab- 
sence of phthalimide. Conversely, suppression 
of protodehalogenation and predominant for- 
mation of the desired cross-coupled product 
was observed in the presence of phthalimide. 
We hypothesized that phthalimide prevents 
the decomposition of the intermediate oxi- 
dative addition complex (OAC), the most 
likely source of protodehalogenation. 

To investigate the possibility of a stabilizing 
interaction between the putative Ni-aryl com- 
plex and phthalimide in the reaction, we sought 
to emulate the formation of such a complex 
under reaction-relevant conditions. The Ni(I]) 
precatalyst was reduced by dropwise addition 
of (Cp*).Co in the presence of carboxylic acid, 
2-tert-butyl-1,1,3,3-tetramethylguanidine 
(BTMG), and aryl bromide (Fig. 4B). We found 
that in the presence of potassium phthalimide, 
an OAC was formed that persisted for several 
hours [as determined by ‘*F-nuclear magnetic 
resonance (‘°F-NMR) characterization], whereas 
in absence of phthalimide, we obtained only 
protodehalogenation and reductive homocou- 
pling products. The identity of the observed 
OAC was subsequently confirmed through 
independent synthesis. 

Isolated complex 3 has substantial stability, 
which stands in stark contrast to the typically 
rapid decomposition of OACs lacking ortho 
substituents on the aryl ligand (46). Complex 3 
is stable for at least 24 hours in the dark in 


both dimethyl sulfoxide (DMSO) and dichloro- 
methane (DCM), as well as under irradiation 
for 2 hours in DMSO. Under reaction-relevant 
conditions, we found that decomposition was 
prevented when an excess of potassium phthal- 
imide was added to prevent ligand exchange by 
carboxylates. Phthalimide thus likely precludes 
typical decomposition pathways by keeping 
complex 3 coordinatively saturated. Despite this 
inherent stability, complex 3 is fully able to cap- 
ture radicals and undergo reductive elimination 
to form the desired cross-coupled product. Irra- 
diation of complex 3 in the presence of an equi- 
molar amount of (Ir[dF(CF3)ppy]2(dtbpy))PF, 
and excess of carboxylic acid, BTMG, and 
potassium phthalimide yields the desired cross- 
coupled product in 67% yield (see the supple- 
mentary materials). 

We then set out to probe the importance of 
complex 3 to the reaction itself. Evaluation 
of the reaction progress of an activated acid 
(N-benzyloxycarbonyl-proline) and a nonacti- 
vated acid (cyclopentanoic acid) in the presence 
of phthalimide revealed that the nonactivated 
acid reacted more slowly overall (Fig. 4B, 
left). A Stern-Volmer analysis revealed a clear 
difference in the photocatalyst quenching 
rate between the two acids (Fig. 4B, middle). 
The slower quenching of nonactivated acids 
translates not only into slower formation of 
alkyl radicals but also slower formation of 
reduced iridium photocatalyst. By extension, 
we reasoned that this leads to slower Ni re- 
duction and slower formation of the OAC. The 
lower steady-state concentration of alkyl rad- 
icals and OACs translates into a slower rate of 
radical capture by Ni, which allows for it to be 
outcompeted by the unimolecular OAC de- 
composition pathway. Stabilizing the OAC 
should therefore benefit nonactivated acids 
because it precludes the decomposition path- 
way and extends the time frame for success- 
ful radical capture. A PhotoNMR experiment 
(using cyclopentanoic acid) in the absence of 
phthalimide revealed no F-NMR signals that 
could be assigned to an OAC. When this experi- 
ment was repeated in the presence of phthal- 
imide, a new °F-NMR signal was observed that 
matched the signal obtained from complex 3, 
indicating a considerable increase in the steady- 
state concentration of the OAC (Fig. 4B, right). 
This further supports our hypothesis. 

Along a different line of inquiry, we realized 
that electron-rich aryl bromides seemed to give 
particularly strong improvements in yield, and 
we suspected a correlation between the elec- 
tronic properties of the aryl group and the 
effect of phthalimide. A Hammett study was 
conducted both in absence and in presence 
of phthalimide to probe this hypothesis (Fig. 
AC, left). A moderately strong Hammett p of 
1.57 was observed in the absence of phthal- 
imide, which suggests that the oxidative ad- 
dition of the aryl bromide contributes to the 
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Mechanistic insight. (A) Decrease of protodehalogenation observed with phthalimide caused by stabilization of the OAC. For the x-ray structure (50% probability 
ds), hydrogens and tBu/CF3 disorder are omitted for clarity. (B) Slower quenching is tolerated because of the increased OAC concentration. (C) Increasing active 
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overall rate of the reaction. We found that 
the addition of phthalimide led to an overall 
increase in the initial reaction rate, in par- 
ticular for electron-rich aryl bromides, and 
a lowered p of 0.56. We reasoned here that 
phthalimide must have some bearing on the 
oxidative addition. 

To uncover the specific role of phthalimide 
in the oxidative addition, we evaluated the 
progress of a different model catalytic reaction 
in the presence and absence of phthalimide 
(Fig. 4C, top; substrate pair yields were 10% 
without phthalimide and 70% with phthal- 
imide). Unexpectedly, in the absence of phthal- 
imide, the reaction underwent deactivation 
over the first 100 min to reach an unproductive 
stationary state, whereas in the presence of 
phthalimide, the reaction steadily went to 
completion (figs. S24 and $25). In conjunction 
with the Hammett data, we found it reason- 
able to postulate a deactivation of the reaction 
caused by a progressive decrease in catalytically 
active Ni able to undergo oxidative addition. 
Phthalimide would thus act by bringing the 
Ni back on cycle. 

To investigate the reactivation hypothesis, 
a competition study between phenyl bromide 
and various electronically distinct aryl bro- 
mides was conducted to assess product ratios 
at an early time point both in absence and 
presence of phthalimide (see the supple- 
mentary materials for details). For each aryl 
bromide, we found that product ratios did 
not change meaningfully when phthalimide 
was added (with the exception of the electron- 
poor 4-bromobenzotrifluoride). This is con- 
sistent with the hypothesis that phthalimide 
serves to increases the amount of Ni compe- 
tent to undergo oxidative addition, which sup- 
ports the proposed reactivation hypothesis. 

It was reasonable to hypothesize that the 
deactivation could be attributed to the forma- 
tion of low-valent Ni oligomers, which are 
known to be unreactive toward oxidative ad- 
dition in their oligomeric form (47, 48). Inde- 
pendently prepared dimer [(dtbbpy)NiBr], was 
subjected to the reaction to test this hypothesis 
(Fig. 4C, middle). Trace yield of product was 
observed with the dimer for the previously 
used substrate pair, which confirms that this Ni 
species by itself is incompetent in this reaction. 
Repeating this experiment in the presence of 
phthalimide afforded the product in 31% yield, 
thus demonstrating that phthalimide can, at 
least in principle, return unreactive multimers 
into a monomeric, catalytically active state. 
Stable (bpy)Ni()-phthalimido complexes have 
been reported, which lends support to this 
hypothesis (49). 

Taking the previous observations together, 
we reasoned that the addition of phthalimide 
to a deactivated reaction should lead to a re- 
activation of the catalysts and resumption of 
product formation. Phthalimide should break 
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up any formed inactive multimeric species and 
thus allow for productive turnover. Consistent 
with our hypothesis, when phthalimide was 
doped into such a reaction, we observed almost 
complete reactivation and steady turnover 
(Fig. 4C, right). 

Overall, we believe that phthalimide acts in 
two mechanistically distinct manners. First, 
phthalimide affects the stability of Ni-aryl 
complexes by acting as a ligand that precludes 
decomposition pathways such as protodeha- 
logenation and aryl metathesis. This also ac- 
counts for the inclusion of nonactivated acids 
into the scope of this transformation, where 
the increased OAC lifetime counteracts the 
lower effective radical concentration inher- 
ent to these acids. Furthermore, when using 
electron-rich aryl bromides, the reaction under- 
goes reversible deactivation caused by a de- 
crease in the effective concentration of on-cycle 
Ni catalyst. This is presumably because of 
the formation of off-cycle, unreactive multi- 
meric species resulting from the aggregation 
of low-valent Ni complexes. Phthalimide is 
capable of reactivating inactive multimeric Ni 
species and thus increasing the concentration 
of catalytically active Ni, which allows the 
catalytic cycle to be productively turned over. 
We do not rule out further effects of phthal- 
imide in this reaction, and additional mecha- 
nistic investigations are currently ongoing. The 
effect of phthalimide on Ni-catalyzed cross- 
couplings in general is also undergoing fur- 
ther investigation. 

In summary, by identifying phthalimide as a 
beneficial additive for decarboxylative arylation, 
we were able to develop a general transfor- 
mation to reliably form C(sp”)-C(sp?) bonds 
from feedstock chemicals. This improvement 
was achieved in less than a year from project 
inception, which highlights the expedited 
nature of additive mapping. Furthermore, we 
then leveraged the discovered phthalimide 
effect to broaden the mechanistic understand- 
ing of Ni-catalyzed cross-coupling from an 
angle not accessible by traditional mechanistic 
studies, which highlights the orthogonality 
of this approach. We imagine that both the use 
of phthalimide as an additive and the herein 
reported approach for reaction generalization 
and mechanistic elucidation will be rapidly 
embraced to affect modern organic synthesis. 
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Gloucester Marine Genomics Institute seeks candidates for the 
position of Chief Scientific Officer based in Gloucester, MA 


GMGI has increased the prominence of its scientific research in marine 
genomics. Having established a world-class research program over the last 
eight years, GMGI is positioned for substantial growth and seeks a visionary 
scientific leader to lead the organization’s research with an integrated strate- 
gic vision for long-term success, and enable the recruitment of exceptional 
junior and senior research scientists. The CSO will consider the interplay of 
GMGI’s expertise in marine genetics and genomics, its research, educational 
and commercial strengths, as well as the evolving ecosystem of biomedical 
research. While honoring Gloucester’s unique culture and history, the CSO 
will drive the organization forward with insight and creativity to a bold vision 
of its future, while also leading with clarity and a strong “human” touch. 


The CSO will be responsible for the Institute’s overall scientific direction, 
establishing key strategic goals and setting organizational performance ex- 
pectations. The successful candidate will have an international reputation and 
well-funded research program. They will create the necessary environment, 
processes and infrastructure to discuss and make critical strategic, financial 
and scientific decisions. 


GMGI addresses critical challenges facing our oceans, human Bit 
health and the environment through innovative scientific research yamgrmyx-* 
and education. By bringing world-class science and transforma- fa] 18) a 
tive workforce development to Gloucester s historic waterfront, onan 
GMGI is catalyzing the regional economy. Founded in 2013, our work 
includes a state-of-the-art research institute where we conduct marine 
research powered by genomics, and the Gloucester Biotechnology Academy, 


preparing young adults to become biotech lab technicians. 


GMGI is proud to be an equal opportunity employer, and all aspects of 


employment are based on merit, competence, performance, and business need. 


Interested individuals submit materials to: recruiting@gmgi.org 


A career plan customized 
for you, by you. 


For your career in science, there’s onlyone Science 


Features in myIDP include: 


= Exercises to help you examine your skills, interests, 
and values. 


= Alist of 20 scientific career paths with a prediction 
of which ones best fit your skills and interests. 


A Visit the website and start planning today! 
ny by myIDP.sciencecareers.org 


ScienceCareers in partnership with: 


BYAAAs 
BURROUGHS 
Se FASEB Yrs, USE FUND 
SESE" UMASS Medical School Univer of Caitoria FUNDES& 


Science Careers helps you advance 


your career. Learn how! 


Register for a free online account on 
ScienceCareers.org. 

Search hundreds of job postings and 
find your perfect job. 

Sign up to receive e-mail alerts about job 
postings that match your criteria. 
Upload your resume into our database 
and connect with employers. 


Watch one of our many webinars on 
different career topics such as job 


Download our career booklets, including 
Career Basics, Careers Beyond the Bench, 
and Developing Your Skills. 


Complete an interactive, personalized 
career plan at “my IDP” 


Visit our Employer Profiles to learn more 
about prospective employers. 


Read relevant career advice articles from 
our library of thousands. 


SCIENCECAREERS.ORG 


Science Careers 


searching, networking, and more. 


Visit ScienceCareers.org 


today — all resources are free 


FROM THE JOURNAL SCIENCE PNAAAS 


TT 


2. 
“aint, »: 2 a 
SS) WG nF RAG 
ee a Bs; Tel: +86-10-56118687 Email: medtalent@mail.tsinghuaedu.cn 


“ters 


eijing Tsinghua Changgung Hospital (BTCH), established on November 28, 2014 as a Tsinghua University-affiliated public 

hospital, is a world-class academic medical center which integrates services, training and research alongside the mission to 

promote health, cultivate outstanding physicians, and lead medical innovations while maintaining excellent management. 

BTCH hosts 7 medical departments which include 52 specialized disciplines, offering comprehensive services from internal 
medicine to surgery, obstetrics-gynecology, pediatrics, emergency and intensive Care and family medicine. In addition to these, the 
hospital has built renowned integrated centers including Hepato-Pancreato-Biliary Center, Neuro-Center, Digestive Disease Center, 
Emergency and Intensive Care Center and Cardiovascular Center. Our medical professionals are from prestigious backgrounds 
and provide diverse perspectives to medical practices internationally. This team is led by the inaugural president Jiahong Dong, an 
academician of the Chinese Academy of Engineering and an internationally recognized HPB and transplant surgeon. 


BTCH aims to provide health-oriented medical services which are guided by key tenets which give first priority to patients and 
operate with diligent doctors as its core foundation. We conceptualized the idea of precision healthcare, compassionate service 
and lean management, in order to visualize a modern healthcare service system which merges medical prevention, treatment, re- 
habilitation, chronic illness management and senior healthcare. Through an accelerated 7-year development, BTCH has become 
a comprehensive high-quality hospital with good hospital governance, a strong professional team, a pool of outstanding talents, 


first-class medical quality and a good reputation in the industry. 


Our services are grounded in cut- 
ting-edge medical technologies, evi- 
dence-based practices and standardized 
care protocols with value-based care. 
Abiding by the belief of “ Perform 
surgeries that others are unwilling to 
risk, and cure diseases that others are 
incapable of”, our hospital has success- 
fully treated a great number of com- 
plicated and difficult cases from home 
and abroad by confronting medical 
problems in various specialized fields. 
Furthermore, BTCH is dedicated to es- 
tablishing a regional integrated health- 
care consortium focusing on the health 
of community residents, by consistently 
providing hierarchical, integrated med- 


BTCH devotes to cultivating outstanding future physician leaders 
through the incorporation of benevolent healthcare practices, scien- 
tific aptitude and leadership. The hospital’s accredited residency and 
fellowship programs are certified by the National Health Commission 
of China with yearly assessments from rigorous national level exami- 
nations yielding scores ranking in the top Sth percentile. The hospital 
was awarded first place in the 2019 annual national residency train- 
ing examination as well as designated as one of the national clinical 
training demonstration centers in 2018. Our system-oriented training 
programs foster health professionals that adapt well to the changing 


landscape of healthcare. 


ical care which encompasses the whole 
population, all aspects of healthcare and 
the one’s entire lifespan. 


As an initiative undertaken by BTCH, 
the Institute of Precision Medicine of Ts- 
inghua University serves as an interdis- 
ciplinary health science innovation plat- 
form, supporting 6 medical-engineering 
research centers and 15 interdisciplinary 
medical laboratories including those 
of big data, digital health, intelligent 
healthcare, medical robotics, wearable 
healthcare device, ECMO, translational 
medical imaging, regional integrated 
medical systems, photomedicine, liver 
metabolomics, precision pharmacy, etc. 


Guided by the precision healthcare theo- 
ry, the Clinical-Driven Research (CDR) 
paradigm and the Bedside to Bench to 
Clients (BBC) model was established 
to combine Tsinghua University’s ac- 
ademic resources in clinical medicine, 
science, engineering, public health, arts 
and others for multidisciplinary collab- 
oration. This platform accumulates into 
an innovation ecosystem that bridges 
healthcare organizations, the academia 
and enterprises, through translating basic 
and clinical research into better and safer 
healthcare practices, and implementation 
of products, a number of which are cur- 
rently either in the later stages of clinical 
trials or in use. 


In following our “41” development strategy that highlights Innovation, Integration, individuation and Internationalization, we em- 
brace talented physicians, surgeons, scientists and other medical professionals of all nationalities and backgrounds to join our BTCH 
family. We are committed to providing a superior platform, competitive benefits and comfortable working environment for each 
faculty member. Let’s convene with the common mission of creating a world class center of excellence, cultivating outstanding phy- 
sicians, provoking cutting-edge medical innovations and achieving “ Healthy China”, a healthful shared future for mankind. 
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Scientist and artist 


ould you say you are more of a scientist or an artist?” I frequently heard this question as I 
was preparing to start my neuroscience doctoral program, often after I told scientific men- 
tors and peers that I love writing poetry. I learned to respond, “A scientist”’—not because 
it was necessarily true, but because I sensed it would win me more respect. As an under- 
graduate, poetry had been my therapy, my cathartic release. But in grad school, I wanted 
to be seen as just as dedicated to my work as those around me, who appeared completely 
undistracted by nonscientific pursuits. So, I left my poetry by the wayside—but it wasn’t long before I 


felt a massive void in my life. 


By the second year of my Ph.D., 
my thesis project had become 
all-consuming and I was start- 
ing to burn out. One afternoon, 
an undergraduate trainee who 
worked in the lab across the hall 
mentioned the weekly event held 
by the university’s poetry club, of 
which she was a leader. After hear- 
ing how much I enjoyed poetry, 
she declared, “You should come!” 
A few days later, I sat in a packed 
café, exulting in the warmth of the 
vibrant community. 

When I received another 
invitation—this time to perform 
my own work—I couldn’t say no. 
I rediscovered catharsis through 
storytelling, writing about work- 
place misogyny I had witnessed. 
As I prepared my poem, though, 
I was also awash with guilt about 
dedicating hours to writing and 
rehearsing, time I could have 
spent reading papers and analyzing data. Still, I finished 
my poem and showed up for my performance. With the 
outpouring of love I received that evening, I knew I’d 
be back. 

I continued to write and perform, but I still worried 
about appearing less than fully dedicated to my science. 
So, when a senior grad student in my lab whispered, “I 
hear you perform poetry!” I froze. Was my cover blown? 
My heart began to race. She went on to invite me to per- 
form at our department’s art gala—a talent show of sorts, 
showcasing faculty, staff, and trainees. “By the way, Cheryl 
will be performing!” she added. 

I was surprised to discover that Cheryl, our lab’s princi- 
pal investigator, was a dancer. I excitedly sought her out, 
and we spoke for more than an hour about how science 
and art can intertwine. She explained that my skill at pick- 


“| decided to perform 
at the gala, for once not feeling 
| had anything to hide.” 


ing up patterns in experimental 
results was the same one I used 
to create rhymes in my poems. 
With Cheryl as a model, I de- 
cided to perform at the gala, for 
once not feeling I had anything 
to hide. 

The night of the gala, I mar- 
veled at the talent on display, 
including stunning musicianship 
and visual art. Chatting with the 
other performers, many of whom 
were also graduate students, I re- 
alized many shared my concern 
that peers might interpret a pas- 
sion for art as a lack of passion 
for science or, at the very least, a 
distraction. But over the course 
of the evening, I found myself at 
peace with the two halves of my 
identity, and I got the feeling oth- 
ers felt that same peace, too. 

Since then, I have continued to 
embrace all of me. While complet- 
ing my Ph.D. and moving on to a postdoc, I frequently scrib- 
bled lines of poetry on Post-its and Kimwipes and wrote 
poems during my lunch break. Some mentors and peers 
have seen the value of this outside interest; others have 
been skeptical. But when I have my own doubts, I think of 
my Ph.D. supervisor, and of renowned 19th century neuro- 
scientist Santiago Ramon y Cajal, whose beautiful drawings 
of the central nervous system can be found in science text- 
books and art museums in equal measure. They remind me 
that art and science can be profoundly complementary. And 
now, if someone asks me whether I’m more of a scientist or 
an artist, I very proudly say, “Both.” 


Asma Bashir is a neuroscientist by training and host of the 
podcast Her Royal Science in Halifax, Canada. Send your career 
story to SciCareerEditor@aaas.org. 
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